v1v2 (latest)

Enriching Word Vectors with Subword Information

15 July 2016

Papers citing "Enriching Word Vectors with Subword Information"

50 / 2,679 papers shown

Title
A Survey on Knowledge-Oriented Retrieval-Augmented Generation Mingyue Cheng Yucong Luo Jie Ouyang Qiang Liu Huijie Liu ... Bohou Zhang Jiawei Cao Jie Ma Daoyu Wang Enhong Chen 3DV 164 7 0 11 Mar 2025
SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc Daniel Guzman-Olivares Lara Quijano-Sanchez Federico Liberatore 65 0 0 07 Mar 2025
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs Ling Team B. Zeng Chenyu Huang Chao Zhang Changxin Tian ... Zhaoxin Huan Zujie Wen Zhenhang Sun Zhuoxuan Du Z. He MoE ALM 198 5 0 07 Mar 2025
Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation A. Zebaze Benoît Sagot Rachel Bawden 134 1 0 06 Mar 2025
Measuring Intrinsic Dimension of Token Embeddings Takuya Kataiwa Cho Hakaze Tetsushi Ohki 85 0 0 04 Mar 2025
AC-Lite : A Lightweight Image Captioning Model for Low-Resource Assamese Language Pankaj Choudhury Yogesh Aggarwal Prabhanjan Jadhav Prithwijit Guha Sukumar Nandi 210 0 0 03 Mar 2025
ConfuGuard: Using Metadata to Detect Active and Stealthy Package Confusion Attacks Accurately and at Scale Wenxin Jiang Berk Çakar Mikola Lysenko James C. Davis 107 0 0 27 Feb 2025
Cross-Modality Investigation on WESAD Stress Classification Eric Oliver Sagnik Dakshit 69 0 0 26 Feb 2025
An Improved Deep Learning Model for Word Embeddings Based Clustering for Large Text Datasets Vijay Kumar Sutrakar Nikhil Mogre 77 0 0 22 Feb 2025
Poisoned Source Code Detection in Code Models Ehab Ghannoum Mohammad Ghafari AAML 105 0 0 19 Feb 2025
LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning Tianshi Zheng Jiayang Cheng Chunyang Li Haochen Shi Ziyi Wang Jiaxin Bai Yangqiu Song Ginny Wong Simon See LRM 161 8 0 16 Feb 2025
Man Made Language Models? Evaluating LLMs' Perpetuation of Masculine Generics Bias Enzo Doyen Amalia Todirascu 94 1 0 14 Feb 2025
Sorting the Babble in Babel: Assessing the Performance of Language Detection Algorithms on the OpenAlex Database Maxime Holmberg Sainte-Marie Diego Kozlowski Lucía Céspedes Vincent Larivière 147 0 0 05 Feb 2025
RiskHarvester: A Risk-based Tool to Prioritize Secret Removal Efforts in Software Artifacts S. Basak Tanmay Pardeshi Bradley Reaves Laurie A. Williams 53 0 0 03 Feb 2025
DepressionX: Knowledge Infused Residual Attention for Explainable Depression Severity Assessment Yusif Ibrahimov Tarique Anwar Tommy Yuan 184 0 0 28 Jan 2025
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs Nicolas Boizard Kevin El Haddad C´eline Hudelot Pierre Colombo 167 19 0 28 Jan 2025
Deep Learning and Natural Language Processing in the Field of Construction Rémy Kessler Nicolas Béchet 123 0 0 14 Jan 2025
Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey) Subba Reddy Oota Zijiao Chen Manish Gupta R. Bapi G. Jobard F. Alexandre X. Hinaut 3DV AI4CE 152 15 0 31 Dec 2024
A Survey on Online User Aggression: Content Detection and Behavioral Analysis on Social Media Swapnil S. Mane Suman Kundu Rajesh Sharma 136 0 0 31 Dec 2024
Comparative Analysis of Document-Level Embedding Methods for Similarity Scoring on Shakespeare Sonnets and Taylor Swift Lyrics Klara Kramer 44 0 0 23 Dec 2024
Domain adapted machine translation: What does catastrophic forgetting forget and why? Danielle Saunders Steve DeNeefe AI4CE 51 1 0 23 Dec 2024
HyperCLIP: Adapting Vision-Language models with Hypernetworks Victor Akinwande Mohammad Sadegh Norouzzadeh Devin Willmott Anna Bair Madan Ravi Ganesh J. Zico Kolter CLIP VLM 159 0 0 21 Dec 2024
PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time Alireza Pourali Arian Boukani Hamzeh Khazaei 115 0 0 20 Dec 2024
Aria-UI: Visual Grounding for GUI Instructions Yuhao Yang Yue Wang Dongxu Li Ziyang Luo Bei Chen Chenyu Huang Junnan Li LM&Ro LLMAG 178 33 0 20 Dec 2024
Is Peer-Reviewing Worth the Effort? Kenneth Ward Church Raman Chandrasekar John E. Ortega Ibrahim Said Ahmad OOD 116 3 0 18 Dec 2024
On Enhancing Root Cause Analysis with SQL Summaries for Failures in Database Workload Replays at SAP HANA Neetha Jambigi Joshua Hammesfahr Moritz Mueller Thomas Bach Michael Felderer 96 0 0 18 Dec 2024
LLMs are Also Effective Embedding Models: An In-depth Overview Chongyang Tao Tao Shen Shen Gao Junshuo Zhang Zhen Li Zhengwei Tao Shuai Ma 143 11 0 17 Dec 2024
Knowledge Migration Framework for Smart Contract Vulnerability Detection Luqi Wang Wenbao Jiang 133 0 0 15 Dec 2024
Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection Ahmed Haj Ahmed Rui-Jie Yew Xerxes Minocher Suresh Venkatasubramanian 102 0 0 14 Dec 2024
GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek Lefteris Loukas Nikolaos Smyrnioudis Chrysa Dikonomaki Spyros Barbakos Anastasios Toumazatos ... Manolis Kyriakakis Mary Georgiou Stavros Vassos John Pavlopoulos Ion Androutsopoulos AILaw 151 0 0 11 Dec 2024
From communities to interpretable network and word embedding: an unified approach Thibault Prouteau Nicolas Dugué Simon Guillot GNN 109 1 0 11 Dec 2024
Bilingual BSARD: Extending Statutory Article Retrieval to Dutch Ehsan Lotfi Nikolay Banar Nerses Yuzbashyan Walter Daelemans AILaw 105 1 0 10 Dec 2024
ORIS: Online Active Learning Using Reinforcement Learning-based Inclusive Sampling for Robust Streaming Analytics System Rahul Pandey Ziwei Zhu Hemant Purohit 101 0 0 27 Nov 2024
FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web Cheng-Wei Lin Wan-Hsuan Hsieh Kai-Xin Guan Chan-Jan Hsu Chia-Chen Kuo Chuan-Lin Lai Chung-Wei Chung Ming-Jen Wang Da-shan Shiu 82 1 0 25 Nov 2024
Writing Style Matters: An Examination of Bias and Fairness in Information Retrieval Systems Hongliu Cao 134 4 0 20 Nov 2024
Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions Robin Carpentier B. Zhao Hassan Jameel Asghar Dali Kaafar 144 1 0 18 Nov 2024
HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings Anton M. Alekseev Gulnara Kabaeva 28 0 0 16 Nov 2024
A Practical Guide to Fine-tuning Language Models with Limited Data Márton Szép Daniel Rueckert Rüdiger von Eisenhart-Rothe Florian Hinterwimmer SyDa ALM 135 2 0 14 Nov 2024
A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information Prashant Kapil Asif Ekbal 81 0 0 11 Nov 2024
Investigating Idiomaticity in Word Representations Wei He Tiago Kramer Vieira Marcos García Carolina Scarton M. Idiart Aline Villavicencio 99 1 0 04 Nov 2024
Zipfian Whitening Sho Yokoi Han Bao Hiroto Kurita Hidetoshi Shimodaira 64 0 0 01 Nov 2024
MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees Ryan Zhang Herbert Woisetschläger Shiqiang Wang Hans-Arno Jacobsen 36 0 0 31 Oct 2024
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages Amir Hossein Kargaran François Yvon Hinrich Schutze VLM 129 8 0 31 Oct 2024
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers Lam Nguyen Tung Steven Cho Xiaoning Du Neelofar Neelofar Valerio Terragni Stefano Ruberto Aldeida Aleti 549 2 0 30 Oct 2024
RELATE: A Modern Processing Platform for Romanian Language V. Pais Radu Ion Andrei-Marius Avram Maria Mitrofan D. Tufis VLM 38 0 0 29 Oct 2024
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning Xun Guo Shan Zhang Yongxin He Ting Zhang Wanquan Feng Haibin Huang Chongyang Ma DeLMO 93 10 0 28 Oct 2024
Measuring individual semantic networks: A simulation study Samuel Aeschbach Rui Mata Dirk U. Wulff 18 0 0 23 Oct 2024
MojoBench: Language Modeling and Benchmarks for Mojo Nishat Raihan Joanna C. S. Santos Marcos Zampieri 86 2 0 23 Oct 2024
ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment Elyas Obbad Iddah Mlauzi Alycia Lee Rylan Schaeffer Kamal Obbad Suhana Bedi Sanmi Koyejo CVBM 146 0 0 23 Oct 2024
LightFusionRec: Lightweight Transformers-Based Cross-Domain Recommendation Model Vansh Kharidia Dhruvi Paprunia Prashasti Kanikar 16 0 0 21 Oct 2024