arXiv - CSCL @arxiv_cscl@qoto.org

http://arxiv.org/abs/2312.04474 #arXiv #NLProc

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator. (arXiv:2312.04474v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2312.04333 #arXiv #NLProc

Beyond Surface: Probing LLaMA Across Scales and Layers. (arXiv:2312.04333v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2312.03733 #arXiv #NLProc

Methods to Estimate Large Language Model Confidence. (arXiv:2312.03733v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2312.03721 #arXiv #NLProc

Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability. (arXiv:2312.03721v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2312.01523 #arXiv #NLProc

SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise. (arXiv:2312.01523v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2311.15766 #arXiv #NLProc

Knowledge Unlearning for LLMs: Tasks, Methods, and Challenges. (arXiv:2311.15766v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2311.14391 #arXiv #NLProc

\'UFAL CorPipe at CRAC 2023: Larger Context Improves Multilingual Coreference Resolution. (arXiv:2311.14391v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2311.13534 #arXiv #NLProc

LM-Cocktail: Resilient Tuning of Language Models via Model Merging. (arXiv:2311.13534v4 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2311.07361 #arXiv #NLProc

The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4. (arXiv:2311.07361v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2310.13032 #arXiv #NLProc

Quality-Diversity through AI Feedback. (arXiv:2310.13032v4 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2310.12300 #arXiv #NLProc

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly. (arXiv:2310.12300v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:20

http://arxiv.org/abs/2310.11374 #arXiv #NLProc

DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations. (arXiv:2310.11374v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

http://arxiv.org/abs/2310.02374 #arXiv #NLProc

Conversational Health Agents: A Personalized LLM-Powered Agent Framework. (arXiv:2310.02374v3 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

http://arxiv.org/abs/2309.16248 #arXiv #NLProc

Spider4SPARQL: A Complex Benchmark for Evaluating Knowledge Graph Question Answering Systems. (arXiv:2309.16248v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

http://arxiv.org/abs/2309.14348 #arXiv #NLProc

Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM. (arXiv:2309.14348v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

http://arxiv.org/abs/2309.11830 #arXiv #NLProc

Goal-Oriented Prompt Attack and Safety Evaluation for LLMs. (arXiv:2309.11830v2 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

http://arxiv.org/abs/2309.03886 #arXiv #NLProc

FIND: A Function Description Benchmark for Evaluating Interpretability Methods. (arXiv:2309.03886v3 [cs.CL] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

http://arxiv.org/abs/2307.16680 #arXiv #NLProc

On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook. (arXiv:2307.16680v5 [cs.LG] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

http://arxiv.org/abs/2306.13596 #arXiv #NLProc

Max-Margin Token Selection in Attention Mechanism. (arXiv:2306.13596v4 [cs.LG] UPDATED)

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 11, 2023, 03:19