Combining Transfer Learning with In-context Learning using Blackbox LLMs for Zero-shot Knowledge Base Question Answering. (arXiv:2311.08894v1 [cs.CL])
Large Language Models are legal but they are not: Making the case for a powerful LegalLLM. (arXiv:2311.08890v1 [cs.CL])
CLIMB: Curriculum Learning for Infant-inspired Model Building. (arXiv:2311.08886v1 [cs.CL])
Enabling Large Language Models to Learn from Rules. (arXiv:2311.08883v1 [cs.CL])
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation. (arXiv:2311.08877v1 [cs.CL])
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining. (arXiv:2311.08849v1 [cs.CL])
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder. (arXiv:2311.08844v1 [cs.CV])
Disinformation Capabilities of Large Language Models. (arXiv:2311.08838v1 [cs.CL])
Evaluating Gender Bias in the Translation of Gender-Neutral Languages into English. (arXiv:2311.08836v1 [cs.CL])
MAP's not dead yet: Uncovering true language model modes by conditioning away degeneracy. (arXiv:2311.08817v1 [cs.CL])
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving. (arXiv:2311.08803v1 [cs.CL])
German FinBERT: A German Pre-trained Language Model. (arXiv:2311.08793v1 [cs.CL])
X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects. (arXiv:2311.08788v1 [cs.CL])
Three Conjectures on Unexpectedeness. (arXiv:2311.08768v1 [cs.AI])
Accelerating Toeplitz Neural Network with Constant-time Inference Complexity. (arXiv:2311.08756v1 [cs.CL])
Thread of Thought Unraveling Chaotic Contexts. (arXiv:2311.08734v1 [cs.CL])
Enhancing Emergency Decision-making with Knowledge Graphs and Large Language Models. (arXiv:2311.08732v1 [cs.CL])
Uncertainty Estimation on Sequential Labeling via Uncertainty Transmission. (arXiv:2311.08726v1 [cs.CL])
Method for Text Entity Linking in Power Distribution Scheduling Oriented to Power Distribution Network Knowledge Graph. (arXiv:2311.08724v1 [cs.CL])
Token Prediction as Implicit Classification to Identify LLM-Generated Text. (arXiv:2311.08723v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl