Show newer

Are Diffusion Models Vision-And-Language Reasoners?. (arXiv:2305.16397v1 [cs.CV]) 

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer. (arXiv:2305.16380v1 [cs.CL]) 

INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition. (arXiv:2305.16371v1 [cs.CL]) 

Role-Play with Large Language Models. (arXiv:2305.16367v1 [cs.CL]) 

Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving. (arXiv:2305.16366v1 [cs.CL]) 

EDM3: Event Detection as Multi-task Text Generation. (arXiv:2305.16357v1 [cs.CL]) 

PandaGPT: One Model To Instruction-Follow Them All. (arXiv:2305.16355v1 [cs.CL]) 

Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion. (arXiv:2305.16353v1 [cs.SD]) 

Lexinvariant Language Models. (arXiv:2305.16349v1 [cs.CL]) 

Leveraging LLMs for KPIs Retrieval from Hybrid Long-Document: A Comprehensive Framework and Dataset. (arXiv:2305.16344v1 [cs.CL]) 

A Distributed Automatic Domain-Specific Multi-Word Term Recognition Architecture using Spark Ecosystem. (arXiv:2305.16343v1 [cs.CL]) 

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition. (arXiv:2305.16342v1 [cs.CL]) 

Segmented Recurrent Transformer: An Efficient Sequence-to-Sequence Model. (arXiv:2305.16340v1 [cs.CL]) 

Don't Trust GPT When Your Question Is Not In English. (arXiv:2305.16339v1 [cs.CL]) 

Think Before You Act: Decision Transformers with Internal Working Memory. (arXiv:2305.16338v1 [cs.LG]) 

Handling Realistic Label Noise in BERT Text Classification. (arXiv:2305.16337v1 [cs.CL]) 

Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text Clustering. (arXiv:2305.16335v1 [cs.CL]) 

OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities. (arXiv:2305.16334v1 [cs.CL]) 

Text Generation with Speech Synthesis for ASR Data Augmentation. (arXiv:2305.16333v1 [cs.CL]) 

Semantic Composition in Visually Grounded Language Models. (arXiv:2305.16328v1 [cs.CL]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.