Show newer

Connecting Speech Encoder and Large Language Model for ASR. (arXiv:2309.13963v2 [eess.AS] UPDATED) 

MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large Models. (arXiv:2309.13079v2 [cs.CL] UPDATED) 

Multimodal Deep Learning for Scientific Imaging Interpretation. (arXiv:2309.12460v2 [cs.LG] UPDATED) 

MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods. (arXiv:2309.10966v2 [cs.CL] UPDATED) 

Investigating the Catastrophic Forgetting in Multimodal Large Language Models. (arXiv:2309.10313v2 [cs.CL] UPDATED) 

Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations. (arXiv:2307.06576v5 [cs.IR] UPDATED) 

MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation. (arXiv:2306.10322v2 [cs.CV] UPDATED) 

Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias. (arXiv:2305.19894v2 [cs.CL] UPDATED) 

Weakly-Supervised Visual-Textual Grounding with Semantic Prior Refinement. (arXiv:2305.10913v2 [cs.CV] UPDATED) 

How to Index Item IDs for Recommendation Foundation Models. (arXiv:2305.06569v6 [cs.IR] UPDATED) 

Disentangling Prosody Representations with Unsupervised Speech Reconstruction. (arXiv:2212.06972v2 [cs.SD] UPDATED) 

Permutation invariant matrix statistics and computational language tasks. (arXiv:2202.06829v2 [cs.CL] UPDATED) 

Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models. (arXiv:2309.15098v1 [cs.CL]) 

VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning. (arXiv:2309.15091v1 [cs.CV]) 

RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models. (arXiv:2309.15088v1 [cs.IR]) 

Natural Language based Context Modeling and Reasoning with LLMs: A Tutorial. (arXiv:2309.15074v1 [cs.CL]) 

Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding. (arXiv:2309.15028v1 [cs.CL]) 

Large Language Model Alignment: A Survey. (arXiv:2309.15025v1 [cs.CL]) 

Question-Answering Approach to Evaluate Legal Summaries. (arXiv:2309.15016v1 [cs.CL]) 

Updated Corpora and Benchmarks for Long-Form Speech Recognition. (arXiv:2309.15013v1 [cs.CL]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.