Show newer

Proceedings of the 3rd International Workshop on Mining and Learning in the Legal Domain (MLLD-23). (arXiv:2311.10733v1 [cs.CY]) 

Chatbot-supported Thesis Writing: An Autoethnographic Report. (arXiv:2311.10729v1 [cs.CY]) 

Large Language Models in Finance: A Survey. (arXiv:2311.10723v1 [q-fin.GN]) 

PsyBench: a balanced and in-depth Psychological Chinese Evaluation Benchmark for Foundation Models. (arXiv:2311.09861v2 [cs.CL] UPDATED) 

In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering. (arXiv:2311.06668v2 [cs.LG] UPDATED) 

Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models. (arXiv:2311.06233v2 [cs.CL] UPDATED) 

Uncovering Intermediate Variables in Transformers using Circuit Probing. (arXiv:2311.04354v2 [cs.CL] UPDATED) 

DUMA: a Dual-Mind Conversational Agent with Fast and Slow Thinking. (arXiv:2310.18075v3 [cs.CL] UPDATED) 

CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion. (arXiv:2310.11248v2 [cs.LG] UPDATED) 

Targeted Image Data Augmentation Increases Basic Skills Captioning Robustness. (arXiv:2309.15991v2 [cs.CV] UPDATED) 

Insights Into the Nutritional Prevention of Macular Degeneration based on a Comparative Topic Modeling Approach. (arXiv:2309.00312v4 [cs.CL] UPDATED) 

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use. (arXiv:2308.06595v3 [cs.CL] UPDATED) 

Who Wrote this Code? Watermarking for Code Generation. (arXiv:2305.15060v2 [cs.CL] UPDATED) 

A Fair and In-Depth Evaluation of Existing End-to-End Entity Linking Systems. (arXiv:2305.14937v2 [cs.CL] UPDATED) 

InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction. (arXiv:2305.14659v2 [cs.CL] UPDATED) 

Hierarchical Catalogue Generation for Literature Review: A Benchmark. (arXiv:2304.03512v3 [cs.CL] UPDATED) 

GPT-4 can pass the Korean National Licensing Examination for Korean Medicine Doctors. (arXiv:2303.17807v2 [cs.CL] UPDATED) 

Language Models can Solve Computer Tasks. (arXiv:2303.17491v3 [cs.CL] UPDATED) 

Classifying COVID-19 vaccine narratives. (arXiv:2207.08522v2 [cs.CL] UPDATED) 

Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search. (arXiv:2203.08436v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.