Show newer

Enhancing Abstractiveness of Summarization Models through Calibrated Distillation. (arXiv:2310.13760v2 [cs.CL] UPDATED) 

Bias in Emotion Recognition with ChatGPT. (arXiv:2310.11753v2 [cs.RO] UPDATED) 

Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis. (arXiv:2310.09909v3 [cs.CV] UPDATED) 

"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters. (arXiv:2310.09219v5 [cs.CL] UPDATED) 

Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors. (arXiv:2310.02980v2 [cs.LG] UPDATED) 

ARN: A Comprehensive Framework and Benchmark for Analogical Reasoning on Narratives. (arXiv:2310.00996v2 [cs.CL] UPDATED) 

GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond. (arXiv:2309.16583v4 [cs.CL] UPDATED) 

How much can ChatGPT really help Computational Biologists in Programming?. (arXiv:2309.09126v2 [cs.AI] UPDATED) 

Synthetic Text Generation using Hypergraph Representations. (arXiv:2309.06550v2 [cs.CL] UPDATED) 

FIND: A Function Description Benchmark for Evaluating Interpretability Methods. (arXiv:2309.03886v2 [cs.CL] UPDATED) 

BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge. (arXiv:2308.16458v4 [cs.LG] UPDATED) 

When Do Program-of-Thoughts Work for Reasoning?. (arXiv:2308.15452v5 [cs.CL] UPDATED) 

NLP-based detection of systematic anomalies among the narratives of consumer complaints. (arXiv:2308.11138v2 [stat.ME] UPDATED) 

Token-Scaled Logit Distillation for Ternary Weight Generative Language Models. (arXiv:2308.06744v4 [cs.CL] UPDATED) 

WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine. (arXiv:2308.05361v3 [cs.CL] UPDATED) 

Bengali Fake Reviews: A Benchmark Dataset and Detection System. (arXiv:2308.01987v2 [cs.CL] UPDATED) 

Joint Prompt Optimization of Stacked LLMs using Variational Inference. (arXiv:2306.12509v2 [cs.CL] UPDATED) 

RS5M: A Large Scale Vision-Language Dataset for Remote Sensing Vision-Language Foundation Model. (arXiv:2306.11300v3 [cs.CV] UPDATED) 

GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition. (arXiv:2306.07848v10 [cs.CL] UPDATED) 

Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex). (arXiv:2306.03997v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.