Show newer

Predicting generalization performance with correctness discriminators. (arXiv:2311.09422v1 [cs.CL]) 

When Large Language Models contradict humans? Large Language Models' Sycophantic Behaviour. (arXiv:2311.09410v1 [cs.CL]) 

Alternatives to the Scaled Dot Product for Attention in the Transformer Neural Network Architecture. (arXiv:2311.09406v1 [cs.LG]) 

To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages. (arXiv:2311.09404v1 [cs.CL]) 

LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systems. (arXiv:2311.09390v1 [cs.CL]) 

Neural machine translation for automated feedback on children's early-stage writing. (arXiv:2311.09389v1 [cs.CL]) 

Banach-Tarski Embeddings and Transformers. (arXiv:2311.09387v1 [cs.LG]) 

Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach. (arXiv:2311.09383v1 [cs.CL]) 

A Survey on Online User Aggression: Content Detection and Behavioural Analysis on Social Media Platforms. (arXiv:2311.09367v1 [cs.CL]) 

LOKE: Linked Open Knowledge Extraction for Automated Knowledge Graph Construction. (arXiv:2311.09366v1 [cs.CL]) 

Investigating the Emergent Audio Classification Ability of ASR Foundation Models. (arXiv:2311.09363v1 [cs.CL]) 

Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science. (arXiv:2311.09358v1 [cs.CL]) 

LePaRD: A Large-Scale Dataset of Judges Citing Precedents. (arXiv:2311.09356v1 [cs.CL]) 

Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization. (arXiv:2311.09344v1 [cs.CL]) 

Pinpoint, Not Criticize: Refining Large Language Models via Fine-Grained Actionable Feedback. (arXiv:2311.09336v1 [cs.CL]) 

Lighter, yet More Faithful: Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization. (arXiv:2311.09335v1 [cs.CL]) 

Improving fit to human reading times via temperature-scaled surprisal. (arXiv:2311.09325v1 [cs.CL]) 

Spoken Word2Vec: A Perspective And Some Techniques. (arXiv:2311.09319v1 [cs.CL]) 

Divergences between Language Models and Human Brains. (arXiv:2311.09308v1 [cs.CL]) 

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models. (arXiv:2311.09278v1 [cs.CL]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.