Show newer

Fine-Grained Image-Text Alignment in Medical Imaging Enables Cyclic Image-Report Generation. (arXiv:2312.08078v4 [cs.CV] UPDATED) 

SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models. (arXiv:2312.07492v4 [cs.CL] UPDATED) 

Steering Llama 2 via Contrastive Activation Addition. (arXiv:2312.06681v2 [cs.CL] UPDATED) 

Assessing AI Chatbots Performance in Comprehensive Standardized Test Preparation; A Case Study with GRE. (arXiv:2312.03719v3 [cs.CL] UPDATED) 

StyleCap: Automatic Speaking-Style Captioning from Speech Based on Speech and Language Self-supervised Learning Models. (arXiv:2311.16509v2 [cs.CL] UPDATED) 

A Baseline Analysis of Reward Models' Ability To Accurately Analyze Foundation Models Under Distribution Shift. (arXiv:2311.14743v6 [cs.CL] UPDATED) 

Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts. (arXiv:2310.14628v2 [cs.CL] UPDATED) 

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis. (arXiv:2310.10477v3 [cs.CL] UPDATED) 

A Comprehensive Evaluation of Tool-Assisted Generation Strategies. (arXiv:2310.10062v2 [cs.CL] UPDATED) 

Rethinking Relation Classification with Graph Meaning Representations. (arXiv:2310.09772v2 [cs.CL] UPDATED) 

Can We Edit Multimodal Large Language Models?. (arXiv:2310.08475v4 [cs.CL] UPDATED) 

Augmenting conformers with structured state-space sequence models for online speech recognition. (arXiv:2309.08551v2 [cs.CL] UPDATED) 

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions. (arXiv:2309.08140v2 [eess.AS] UPDATED) 

CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders. (arXiv:2309.07707v2 [cs.CL] UPDATED) 

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models. (arXiv:2309.02772v3 [cs.SE] UPDATED) 

Audio Generation with Multiple Conditional Diffusion Model. (arXiv:2308.11940v4 [cs.SD] UPDATED) 

StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data. (arXiv:2308.10253v2 [cs.CV] UPDATED) 

Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue. (arXiv:2308.03549v3 [cs.CL] UPDATED) 

EnrichEvent: Enriching Social Data with Contextual Information for Emerging Event Extraction. (arXiv:2307.16082v4 [cs.CL] UPDATED) 

A Comprehensive Overview of Large Language Models. (arXiv:2307.06435v7 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.