Show newer

Max-Margin Token Selection in Attention Mechanism. (arXiv:2306.13596v2 [cs.LG] UPDATED) 

Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation. (arXiv:2306.13460v2 [cs.CL] UPDATED) 

DiversiGATE: A Comprehensive Framework for Reliable Large Language Models. (arXiv:2306.13230v2 [cs.CL] UPDATED) 

Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models. (arXiv:2306.08952v2 [cs.CL] UPDATED) 

Survey on Sociodemographic Bias in Natural Language Processing. (arXiv:2306.08158v2 [cs.CL] UPDATED) 

LLMZip: Lossless Text Compression using Large Language Models. (arXiv:2306.04050v2 [cs.IT] UPDATED) 

Language Models are Bounded Pragmatic Speakers. (arXiv:2305.17760v3 [cs.CL] UPDATED) 

Cross-Attention is Not Enough: Incongruity-Aware Hierarchical Multimodal Sentiment Analysis and Emotion Recognition. (arXiv:2305.13583v2 [cs.CL] UPDATED) 

Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test. (arXiv:2305.13108v3 [eess.AS] UPDATED) 

Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language Modeling. (arXiv:2305.11543v2 [cs.CL] UPDATED) 

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers. (arXiv:2305.07011v2 [cs.CV] UPDATED) 

mCPT at SemEval-2023 Task 3: Multilingual Label-Aware Contrastive Pre-Training of Transformers for Few- and Zero-shot Framing Detection. (arXiv:2303.09901v2 [cs.CL] UPDATED) 

Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering. (arXiv:2303.05352v2 [cs.CL] UPDATED) 

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks. (arXiv:2302.13939v4 [cs.CL] UPDATED) 

Auditing large language models: a three-layered approach. (arXiv:2302.08500v2 [cs.CL] UPDATED) 

Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models. (arXiv:2212.09553v2 [cs.CL] UPDATED) 

WACO: Word-Aligned Contrastive Learning for Speech Translation. (arXiv:2212.09359v2 [cs.CL] UPDATED) 

Imagination is All You Need! Curved Contrastive Learning for Abstract Sequence Modeling Utilized on Long Short-Term Dialogue Planning. (arXiv:2211.07591v2 [cs.CL] UPDATED) 

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. (arXiv:2211.05100v4 [cs.CL] UPDATED) 

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control. (arXiv:2210.17432v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.