Enhancing Abstractiveness of Summarization Models through Calibrated Distillation. (arXiv:2310.13760v2 [cs.CL] UPDATED)
Bias in Emotion Recognition with ChatGPT. (arXiv:2310.11753v2 [cs.RO] UPDATED)
Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis. (arXiv:2310.09909v3 [cs.CV] UPDATED)
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters. (arXiv:2310.09219v5 [cs.CL] UPDATED)
Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors. (arXiv:2310.02980v2 [cs.LG] UPDATED)
ARN: A Comprehensive Framework and Benchmark for Analogical Reasoning on Narratives. (arXiv:2310.00996v2 [cs.CL] UPDATED)
GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond. (arXiv:2309.16583v4 [cs.CL] UPDATED)
How much can ChatGPT really help Computational Biologists in Programming?. (arXiv:2309.09126v2 [cs.AI] UPDATED)
Synthetic Text Generation using Hypergraph Representations. (arXiv:2309.06550v2 [cs.CL] UPDATED)
FIND: A Function Description Benchmark for Evaluating Interpretability Methods. (arXiv:2309.03886v2 [cs.CL] UPDATED)
BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge. (arXiv:2308.16458v4 [cs.LG] UPDATED)
When Do Program-of-Thoughts Work for Reasoning?. (arXiv:2308.15452v5 [cs.CL] UPDATED)
NLP-based detection of systematic anomalies among the narratives of consumer complaints. (arXiv:2308.11138v2 [stat.ME] UPDATED)
Token-Scaled Logit Distillation for Ternary Weight Generative Language Models. (arXiv:2308.06744v4 [cs.CL] UPDATED)
WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine. (arXiv:2308.05361v3 [cs.CL] UPDATED)
Bengali Fake Reviews: A Benchmark Dataset and Detection System. (arXiv:2308.01987v2 [cs.CL] UPDATED)
Joint Prompt Optimization of Stacked LLMs using Variational Inference. (arXiv:2306.12509v2 [cs.CL] UPDATED)
RS5M: A Large Scale Vision-Language Dataset for Remote Sensing Vision-Language Foundation Model. (arXiv:2306.11300v3 [cs.CV] UPDATED)
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition. (arXiv:2306.07848v10 [cs.CL] UPDATED)
Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex). (arXiv:2306.03997v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl