StyleCap: Automatic Speaking-Style Captioning from Speech Based on Speech and Language Self-supervised Learning Models. (arXiv:2311.16509v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2311.16509 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

A Baseline Analysis of Reward Models' Ability To Accurately Analyze Foundation Models Under Distribution Shift. (arXiv:2311.14743v6 [cs.CL] UPDATED)

http://arxiv.org/abs/2311.14743 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts. (arXiv:2310.14628v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2310.14628 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis. (arXiv:2310.10477v3 [cs.CL] UPDATED)

http://arxiv.org/abs/2310.10477 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

A Comprehensive Evaluation of Tool-Assisted Generation Strategies. (arXiv:2310.10062v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2310.10062 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Rethinking Relation Classification with Graph Meaning Representations. (arXiv:2310.09772v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2310.09772 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Can We Edit Multimodal Large Language Models?. (arXiv:2310.08475v4 [cs.CL] UPDATED)

http://arxiv.org/abs/2310.08475 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Augmenting conformers with structured state-space sequence models for online speech recognition. (arXiv:2309.08551v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2309.08551 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions. (arXiv:2309.08140v2 [eess.AS] UPDATED)

http://arxiv.org/abs/2309.08140 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders. (arXiv:2309.07707v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2309.07707 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models. (arXiv:2309.02772v3 [cs.SE] UPDATED)

http://arxiv.org/abs/2309.02772 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Audio Generation with Multiple Conditional Diffusion Model. (arXiv:2308.11940v4 [cs.SD] UPDATED)

http://arxiv.org/abs/2308.11940 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data. (arXiv:2308.10253v2 [cs.CV] UPDATED)

http://arxiv.org/abs/2308.10253 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Dec 31, 2023, 03:19

Dec 31, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue. (arXiv:2308.03549v3 [cs.CL] UPDATED)

http://arxiv.org/abs/2308.03549 #arXiv #NLProc