Using Large Language Model Annotations for Valid Downstream Statistical Inference in Social Science: Design-Based Semi-Supervised Learning. (arXiv:2306.04746v1 [stat.ME])

http://arxiv.org/abs/2306.04746 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

Jun 11, 2023, 03:26

arXiv - CSCL @arxiv_cscl@qoto.org

ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems. (arXiv:2306.04743v1 [cs.DB])

http://arxiv.org/abs/2306.04743 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

Jun 11, 2023, 03:26

arXiv - CSCL @arxiv_cscl@qoto.org

Soft-prompt Tuning for Large Language Models to Evaluate Bias. (arXiv:2306.04735v1 [cs.CL])

http://arxiv.org/abs/2306.04735 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

Jun 11, 2023, 03:26

arXiv - CSCL @arxiv_cscl@qoto.org

Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation. (arXiv:2306.04724v1 [cs.CL])

http://arxiv.org/abs/2306.04724 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

Jun 11, 2023, 03:26

arXiv - CSCL @arxiv_cscl@qoto.org

Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts. (arXiv:2306.04723v1 [cs.CL])

http://arxiv.org/abs/2306.04723 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

Jun 11, 2023, 03:26

arXiv - CSCL @arxiv_cscl@qoto.org

Improving Open Language Models by Learning from Organic Interactions. (arXiv:2306.04707v1 [cs.CL])

http://arxiv.org/abs/2306.04707 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

Jun 11, 2023, 03:26

arXiv - CSCL @arxiv_cscl@qoto.org

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models. (arXiv:2306.04695v1 [cs.CV])

http://arxiv.org/abs/2306.04695 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 11, 2023, 03:26

Jun 11, 2023, 03:26

arXiv - CSCL @arxiv_cscl@qoto.org

Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge. (arXiv:2306.04657v1 [cs.CL])

http://arxiv.org/abs/2306.04657 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

Improving Conversational Recommendation Systems via Counterfactual Data Simulation. (arXiv:2306.02842v1 [cs.CL] CROSS LISTED)

http://arxiv.org/abs/2306.02842 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning. (arXiv:2306.04387v2 [cs.CV] UPDATED)

http://arxiv.org/abs/2306.04387 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

GPT Self-Supervision for a Better Data Annotator. (arXiv:2306.04349v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2306.04349 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset. (arXiv:2306.03030v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2306.03030 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

UNIDECOR: A Unified Deception Corpus for Cross-Corpus Deception Detection. (arXiv:2306.02827v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2306.02827 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models. (arXiv:2306.01506v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2306.01506 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

An Empirical Study on Challenging Math Problem Solving with GPT-4. (arXiv:2306.01337v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2306.01337 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

Supplementary Features of BiLSTM for Enhanced Sequence Labeling. (arXiv:2305.19928v3 [cs.CL] UPDATED)

http://arxiv.org/abs/2305.19928 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

**arXiv - CSCL** @arxiv_cscl@qoto.org · Jun 10, 2023, 03:18

Jun 10, 2023, 03:18

arXiv - CSCL @arxiv_cscl@qoto.org

A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets. (arXiv:2305.18486v3 [cs.CL] UPDATED)

http://arxiv.org/abs/2305.18486 #arXiv #NLProc

Show older

Bot

All recent Computation and Language articles on arXiv.org for the Fediverse

Inspired by https://twitter.com/arxiv_cscl

Joined Nov 2022

arXiv - CSCL @arxiv_cscl@qoto.org

Resources

Developers

What is Mastodon?

qoto.org

More…