Max-Margin Token Selection in Attention Mechanism. (arXiv:2306.13596v2 [cs.LG] UPDATED)
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation. (arXiv:2306.13460v2 [cs.CL] UPDATED)
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models. (arXiv:2306.13230v2 [cs.CL] UPDATED)
Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models. (arXiv:2306.08952v2 [cs.CL] UPDATED)
Survey on Sociodemographic Bias in Natural Language Processing. (arXiv:2306.08158v2 [cs.CL] UPDATED)
LLMZip: Lossless Text Compression using Large Language Models. (arXiv:2306.04050v2 [cs.IT] UPDATED)
Language Models are Bounded Pragmatic Speakers. (arXiv:2305.17760v3 [cs.CL] UPDATED)
Cross-Attention is Not Enough: Incongruity-Aware Hierarchical Multimodal Sentiment Analysis and Emotion Recognition. (arXiv:2305.13583v2 [cs.CL] UPDATED)
Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test. (arXiv:2305.13108v3 [eess.AS] UPDATED)
Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language Modeling. (arXiv:2305.11543v2 [cs.CL] UPDATED)
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers. (arXiv:2305.07011v2 [cs.CV] UPDATED)
mCPT at SemEval-2023 Task 3: Multilingual Label-Aware Contrastive Pre-Training of Transformers for Few- and Zero-shot Framing Detection. (arXiv:2303.09901v2 [cs.CL] UPDATED)
Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering. (arXiv:2303.05352v2 [cs.CL] UPDATED)
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks. (arXiv:2302.13939v4 [cs.CL] UPDATED)
Auditing large language models: a three-layered approach. (arXiv:2302.08500v2 [cs.CL] UPDATED)
Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models. (arXiv:2212.09553v2 [cs.CL] UPDATED)
WACO: Word-Aligned Contrastive Learning for Speech Translation. (arXiv:2212.09359v2 [cs.CL] UPDATED)
Imagination is All You Need! Curved Contrastive Learning for Abstract Sequence Modeling Utilized on Long Short-Term Dialogue Planning. (arXiv:2211.07591v2 [cs.CL] UPDATED)
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. (arXiv:2211.05100v4 [cs.CL] UPDATED)
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control. (arXiv:2210.17432v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl