Don't throw away your value model! Making PPO even better via Value-Guided Monte-Carlo Tree Search decoding. (arXiv:2309.15028v2 [cs.CL] UPDATED)
Foundation Metrics: Quantifying Effectiveness of Healthcare Conversations powered by Generative AI. (arXiv:2309.12444v2 [cs.CL] UPDATED)
Comparative Performance Evaluation of Large Language Models for Extracting Molecular Interactions and Pathway Knowledge. (arXiv:2307.08813v2 [cs.CL] UPDATED)
Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition. (arXiv:2307.07280v2 [cs.CL] UPDATED)
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias. (arXiv:2306.15895v2 [cs.CL] UPDATED)
PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts. (arXiv:2306.04528v4 [cs.CL] UPDATED)
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey. (arXiv:2305.18703v6 [cs.CL] UPDATED)
Difference-Masking: Choosing What to Mask in Continued Pretraining. (arXiv:2305.14577v2 [cs.LG] UPDATED)
Question Answering as Programming for Solving Time-Sensitive Questions. (arXiv:2305.14221v2 [cs.CL] UPDATED)
When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale. (arXiv:2305.14124v2 [cs.CL] UPDATED)
MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation. (arXiv:2305.12733v2 [cs.CL] UPDATED)
Multilingual Simplification of Medical Texts. (arXiv:2305.12532v4 [cs.CL] UPDATED)
Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge. (arXiv:2305.12212v2 [cs.CL] UPDATED)
Learning to Compose Representations of Different Encoder Layers towards Improving Compositional Generalization. (arXiv:2305.12169v2 [cs.CL] UPDATED)
Revisiting Entropy Rate Constancy in Text. (arXiv:2305.12084v2 [cs.CL] UPDATED)
Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews. (arXiv:2305.11828v3 [cs.CL] UPDATED)
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate. (arXiv:2305.11595v3 [cs.CL] UPDATED)
PlugMed: Improving Specificity in Patient-Centered Medical Dialogue Generation using In-Context Learning. (arXiv:2305.11508v2 [cs.CL] UPDATED)
Cross-modality Data Augmentation for End-to-End Sign Language Translation. (arXiv:2305.11096v3 [cs.CL] UPDATED)
Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks. (arXiv:2305.10160v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl