Show newer

Relative Value Biases in Large Language Models. (arXiv:2401.14530v1 [cs.CL]) 

MEDs for PETs: Multilingual Euphemism Disambiguation for Potentially Euphemistic Terms. (arXiv:2401.14526v1 [cs.CL]) 

Evaluating GPT-3.5's Awareness and Summarization Abilities for European Constitutional Texts with Shared Topics. (arXiv:2401.14524v1 [cs.CL]) 

Empathy and the Right to Be an Exception: What LLMs Can and Cannot Do. (arXiv:2401.14523v1 [cs.CY]) 

K-QA: A Real-World Medical Q&A Benchmark. (arXiv:2401.14493v1 [cs.CL]) 

LongHealth: A Question Answering Benchmark with Long Clinical Documents. (arXiv:2401.14490v1 [cs.CL]) 

Wordflow: Social Prompt Engineering for Large Language Models. (arXiv:2401.14447v1 [cs.HC]) 

Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models. (arXiv:2401.14440v1 [cs.CL]) 

Instructional Fingerprinting of Large Language Models. (arXiv:2401.12255v1 [cs.CR] CROSS LISTED) 

MM-LLMs: Recent Advances in MultiModal Large Language Models. (arXiv:2401.13601v2 [cs.CL] UPDATED) 

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation. (arXiv:2401.13527v2 [cs.CL] UPDATED) 

What the Weight?! A Unified Framework for Zero-Shot Knowledge Composition. (arXiv:2401.12756v2 [cs.CL] UPDATED) 

Energy-based Automated Model Evaluation. (arXiv:2401.12689v2 [cs.LG] UPDATED) 

BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models. (arXiv:2401.12522v2 [cs.CL] UPDATED) 

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences. (arXiv:2401.10529v2 [cs.CV] UPDATED) 

Top in Chinese Data Processing: English Code Models. (arXiv:2401.10286v2 [cs.CL] UPDATED) 

Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Models. (arXiv:2401.08491v2 [cs.CL] UPDATED) 

TrustLLM: Trustworthiness in Large Language Models. (arXiv:2401.05561v3 [cs.CL] UPDATED) 

Can AI Be as Creative as Humans?. (arXiv:2401.01623v4 [cs.AI] UPDATED) 

Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation. (arXiv:2312.15643v2 [cs.AI] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.