Show newer

Cross Modal Global Local Representation Learning from Radiology Reports and X-Ray Chest Images. (arXiv:2301.10951v1 [cs.CV]) 

Affective Faces for Goal-Driven Dyadic Communication. (arXiv:2301.10939v1 [cs.CV]) 

Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning. (arXiv:2301.10915v1 [cs.CL]) 

Causal Reasoning of Entities and Events in Procedural Texts. (arXiv:2301.10896v1 [cs.CL]) 

Improving Text-based Early Prediction by Distillation from Privileged Time-Series Text. (arXiv:2301.10887v1 [cs.CL]) 

Break It Down: Evidence for Structural Compositionality in Neural Networks. (arXiv:2301.10884v1 [cs.CL]) 

Qualitative Analysis of a Graph Transformer Approach to Addressing Hate Speech: Adapting to Dynamically Changing Content. (arXiv:2301.10871v1 [cs.LG]) 

Partial Mobilization: Tracking Multilingual Information Flows Amongst Russian Media Outlets and Telegram. (arXiv:2301.10856v1 [cs.CY]) 

On the inconsistency of separable losses for structured prediction. (arXiv:2301.10810v1 [cs.LG]) 

Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering. (arXiv:2301.10799v1 [cs.CL]) 

Ontology-enhanced Prompt-tuning for Few-shot Learning. (arXiv:2201.11332v1 [cs.CL] CROSS LISTED) 

Document-level Relation Extraction as Semantic Segmentation. (arXiv:2106.03618v2 [cs.CL] CROSS LISTED) 

ViHOS: Hate Speech Spans Detection for Vietnamese. (arXiv:2301.10186v2 [cs.CL] UPDATED) 

Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study. (arXiv:2301.09099v2 [cs.CL] UPDATED) 

Parsel: A (De-)compositional Framework for Algorithmic Reasoning with Language Models. (arXiv:2212.10561v2 [cs.CL] UPDATED) 

Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task. (arXiv:2210.13382v3 [cs.LG] UPDATED) 

Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought. (arXiv:2210.01240v3 [cs.CL] UPDATED) 

Review of Natural Language Processing in Pharmacology. (arXiv:2208.10228v2 [cs.CL] UPDATED) 

VAuLT: Augmenting the Vision-and-Language Transformer for Sentiment Classification on Social Media. (arXiv:2208.09021v3 [cs.CV] UPDATED) 

BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning. (arXiv:2206.08657v3 [cs.CV] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.