Show newer

Rethinking Benchmark and Contamination for Language Models with Rephrased Samples. (arXiv:2311.04850v1 [cs.CL]) 

Hierarchically Gated Recurrent Neural Network for Sequence Modeling. (arXiv:2311.04823v1 [cs.CL]) 

MTGER: Multi-view Temporal Graph Enhanced Temporal Reasoning over Time-Involved Document. (arXiv:2311.04816v1 [cs.CL]) 

DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining. (arXiv:2311.04799v1 [cs.CL]) 

Determination of toxic comments and unintended model bias minimization using Deep learning approach. (arXiv:2311.04789v1 [cs.LG]) 

Using large language models to study human memory for meaningful narratives. (arXiv:2311.04742v1 [cs.CL]) 

Evaluating Generative Ad Hoc Information Retrieval. (arXiv:2311.04694v1 [cs.IR]) 

Pre-training LLMs using human-like development data corpus. (arXiv:2311.04666v1 [cs.CL]) 

Speech language models lack important brain-relevant semantics. (arXiv:2311.04664v1 [cs.CL]) 

Massive Editing for Large Language Models via Meta Learning. (arXiv:2311.04661v1 [cs.CL]) 

TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models. (arXiv:2311.04589v1 [cs.CL]) 

Investigating the Nature of Disagreements on Mid-Scale Ratings: A Case Study on the Abstractness-Concreteness Continuum. (arXiv:2311.04563v1 [cs.CL]) 

Assessing Distractors in Multiple-Choice Tests. (arXiv:2311.04554v1 [cs.CL]) 

Large GPT-like Models are Bad Babies: A Closer Look at the Relationship between Linguistic Competence and Psycholinguistic Measures. (arXiv:2311.04547v1 [cs.CL]) 

RankAug: Augmented data ranking for text classification. (arXiv:2311.04535v1 [cs.CL]) 

Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR. (arXiv:2311.04534v1 [cs.CL]) 

Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction. (arXiv:2311.04507v1 [cs.CL]) 

NExT-Chat: An LMM for Chat, Detection and Segmentation. (arXiv:2311.04498v1 [cs.CV]) 

Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection. (arXiv:2311.04495v1 [cs.CL]) 

CLearViD: Curriculum Learning for Video Description. (arXiv:2311.04480v1 [cs.CV]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.