Show newer

Self-Influence Guided Data Reweighting for Language Model Pre-training. (arXiv:2311.00913v1 [cs.CL]) 

Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition. (arXiv:2311.00906v1 [cs.CL]) 

On The Open Prompt Challenge In Conditional Audio Generation. (arXiv:2311.00897v1 [cs.SD]) 

In-Context Prompt Editing For Conditional Audio Generation. (arXiv:2311.00895v1 [cs.SD]) 

Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models. (arXiv:2311.00871v1 [cs.LG]) 

Automatic Disfluency Detection from Untranscribed Speech. (arXiv:2311.00867v1 [eess.AS]) 

Training Dynamics of Contextual N-Grams in Language Models. (arXiv:2311.00863v1 [cs.LG]) 

Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing. (arXiv:2311.00835v1 [cs.CL]) 

Construction Artifacts in Metaphor Identification Datasets. (arXiv:2311.00790v1 [cs.CL]) 

Language Model Training Paradigms for Clinical Feature Embeddings. (arXiv:2311.00768v1 [cs.LG]) 

Challenges for Linguistically-Driven Computer-Based Sign Recognition from Continuous Signing for American Sign Language. (arXiv:2311.00762v1 [cs.CV]) 

Can Large Language Models Design Accurate Label Functions?. (arXiv:2311.00739v1 [cs.CL]) 

tmn at #SMM4H 2023: Comparing Text Preprocessing Techniques for Detecting Tweets Self-reporting a COVID-19 Diagnosis. (arXiv:2311.00732v1 [cs.CL]) 

Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations. (arXiv:2310.20246v2 [cs.CL] UPDATED) 

Combining Language Models For Specialized Domains: A Colorful Approach. (arXiv:2310.19708v3 [cs.CL] UPDATED) 

InfoEntropy Loss to Mitigate Bias of Learning Difficulties for Generative Language Models. (arXiv:2310.19531v2 [cs.CL] UPDATED) 

Improving Factual Consistency of Text Summarization by Adversarially Decoupling Comprehension and Embellishment Abilities of LLMs. (arXiv:2310.19347v2 [cs.CL] UPDATED) 

CXR-LLaVA: Multimodal Large Language Model for Interpreting Chest X-ray Images. (arXiv:2310.18341v2 [cs.CL] UPDATED) 

Qilin-Med-VL: Towards Chinese Large Vision-Language Model for General Healthcare. (arXiv:2310.17956v2 [cs.CV] UPDATED) 

CodeFusion: A Pre-trained Diffusion Model for Code Generation. (arXiv:2310.17680v3 [cs.SE] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.