Show newer

XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models. (arXiv:2301.10472v1 [cs.CL]) 

Improved Stock Price Movement Classification Using News Articles Based on Embeddings and Label Smoothing. (arXiv:2301.10458v1 [cs.LG]) 

Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection. (arXiv:2301.10451v1 [cs.CL]) 

Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute. (arXiv:2301.10448v1 [cs.CL]) 

An Experimental Study on Pretraining Transformers from Scratch for IR. (arXiv:2301.10444v1 [cs.IR]) 

ViDeBERTa: A powerful pre-trained language model for Vietnamese. (arXiv:2301.10439v1 [cs.CL]) 

Is This Abstract Generated by AI? A Research for the Gap between AI-generated Scientific Text and Human-written Scientific Text. (arXiv:2301.10416v1 [cs.CL]) 

BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing. (arXiv:2301.10412v1 [cs.CL]) 

One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER. (arXiv:2301.10410v1 [cs.CL]) 

Editing Language Model-based Knowledge Graph Embeddings. (arXiv:2301.10405v1 [cs.CL]) 

XNLI: Explaining and Diagnosing NLI-based Visual Data Analysis. (arXiv:2301.10385v1 [cs.HC]) 

Weakly Supervised Headline Dependency Parsing. (arXiv:2301.10371v1 [cs.CL]) 

Language Model Detoxification in Dialogue with Contextualized Stance Control. (arXiv:2301.10368v1 [cs.CL]) 

Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction. (arXiv:2301.10309v1 [cs.LG]) 

Large language models can segment narrative events similarly to humans. (arXiv:2301.10297v1 [cs.CL]) 

Audience-Centric Natural Language Generation via Style Infusion. (arXiv:2301.10283v1 [cs.CL]) 

Language Agnostic Data-Driven Inverse Text Normalization. (arXiv:2301.08506v2 [cs.CL] UPDATED) 

Grammar construction methods for extended deterministic expressions. (arXiv:2301.01621v2 [cs.CL] UPDATED) 

Foresight -- Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRs. (arXiv:2212.08072v2 [cs.CL] UPDATED) 

Reasoning over Different Types of Knowledge Graphs: Static, Temporal and Multi-Modal. (arXiv:2212.05767v5 [cs.AI] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.