Show newer

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding. (arXiv:2306.02858v4 [cs.CL] UPDATED) 

Training Priors Predict Text-To-Image Model Performance. (arXiv:2306.01755v2 [cs.CV] UPDATED) 

Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction. (arXiv:2306.01439v2 [cs.LG] UPDATED) 

Natural Language Decompositions of Implicit Content Enable Better Text Representations. (arXiv:2305.14583v2 [cs.CL] UPDATED) 

Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries. (arXiv:2305.14482v2 [cs.CL] UPDATED) 

Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach. (arXiv:2305.14410v2 [cs.CV] UPDATED) 

Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality. (arXiv:2305.13812v3 [cs.CL] UPDATED) 

Asking Clarification Questions to Handle Ambiguity in Open-Domain QA. (arXiv:2305.13808v2 [cs.CL] UPDATED) 

A Diachronic Analysis of Paradigm Shifts in NLP Research: When, How, and Why?. (arXiv:2305.12920v3 [cs.CL] UPDATED) 

Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation. (arXiv:2305.12786v2 [cs.CL] UPDATED) 

TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks. (arXiv:2305.11430v2 [cs.AI] UPDATED) 

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining. (arXiv:2305.10429v3 [cs.CL] UPDATED) 

FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy. (arXiv:2305.10307v4 [cs.CL] UPDATED) 

StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure. (arXiv:2305.05588v2 [cs.CL] UPDATED) 

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs. (arXiv:2304.08244v2 [cs.CL] UPDATED) 

Goal Driven Discovery of Distributional Differences via Language Descriptions. (arXiv:2302.14233v2 [cs.CL] UPDATED) 

Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?. (arXiv:2301.12609v4 [cs.LG] UPDATED) 

JASMINE: Arabic GPT Models for Few-Shot Learning. (arXiv:2212.10755v2 [cs.CL] UPDATED) 

Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval. (arXiv:2212.10526v3 [cs.CL] UPDATED) 

Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks. (arXiv:2212.09912v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.