Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.04606
Cited By
v1
v2 (latest)
Enriching Word Vectors with Subword Information
15 July 2016
Piotr Bojanowski
Edouard Grave
Armand Joulin
Tomas Mikolov
NAI
SSL
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Enriching Word Vectors with Subword Information"
50 / 2,679 papers shown
Title
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Enhong Chen
3DV
164
7
0
11 Mar 2025
SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc
Daniel Guzman-Olivares
Lara Quijano-Sanchez
Federico Liberatore
65
0
0
07 Mar 2025
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Ling Team
B. Zeng
Chenyu Huang
Chao Zhang
Changxin Tian
...
Zhaoxin Huan
Zujie Wen
Zhenhang Sun
Zhuoxuan Du
Z. He
MoE
ALM
198
5
0
07 Mar 2025
Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation
A. Zebaze
Benoît Sagot
Rachel Bawden
134
1
0
06 Mar 2025
Measuring Intrinsic Dimension of Token Embeddings
Takuya Kataiwa
Cho Hakaze
Tetsushi Ohki
85
0
0
04 Mar 2025
AC-Lite : A Lightweight Image Captioning Model for Low-Resource Assamese Language
Pankaj Choudhury
Yogesh Aggarwal
Prabhanjan Jadhav
Prithwijit Guha
Sukumar Nandi
210
0
0
03 Mar 2025
ConfuGuard: Using Metadata to Detect Active and Stealthy Package Confusion Attacks Accurately and at Scale
Wenxin Jiang
Berk Çakar
Mikola Lysenko
James C. Davis
107
0
0
27 Feb 2025
Cross-Modality Investigation on WESAD Stress Classification
Eric Oliver
Sagnik Dakshit
69
0
0
26 Feb 2025
An Improved Deep Learning Model for Word Embeddings Based Clustering for Large Text Datasets
Vijay Kumar Sutrakar
Nikhil Mogre
77
0
0
22 Feb 2025
Poisoned Source Code Detection in Code Models
Ehab Ghannoum
Mohammad Ghafari
AAML
105
0
0
19 Feb 2025
LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning
Tianshi Zheng
Jiayang Cheng
Chunyang Li
Haochen Shi
Ziyi Wang
Jiaxin Bai
Yangqiu Song
Ginny Wong
Simon See
LRM
161
8
0
16 Feb 2025
Man Made Language Models? Evaluating LLMs' Perpetuation of Masculine Generics Bias
Enzo Doyen
Amalia Todirascu
94
1
0
14 Feb 2025
Sorting the Babble in Babel: Assessing the Performance of Language Detection Algorithms on the OpenAlex Database
Maxime Holmberg Sainte-Marie
Diego Kozlowski
Lucía Céspedes
Vincent Larivière
147
0
0
05 Feb 2025
RiskHarvester: A Risk-based Tool to Prioritize Secret Removal Efforts in Software Artifacts
S. Basak
Tanmay Pardeshi
Bradley Reaves
Laurie A. Williams
53
0
0
03 Feb 2025
DepressionX: Knowledge Infused Residual Attention for Explainable Depression Severity Assessment
Yusif Ibrahimov
Tarique Anwar
Tommy Yuan
184
0
0
28 Jan 2025
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Nicolas Boizard
Kevin El Haddad
C´eline Hudelot
Pierre Colombo
167
19
0
28 Jan 2025
Deep Learning and Natural Language Processing in the Field of Construction
Rémy Kessler
Nicolas Béchet
123
0
0
14 Jan 2025
Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
Subba Reddy Oota
Zijiao Chen
Manish Gupta
R. Bapi
G. Jobard
F. Alexandre
X. Hinaut
3DV
AI4CE
152
15
0
31 Dec 2024
A Survey on Online User Aggression: Content Detection and Behavioral Analysis on Social Media
Swapnil S. Mane
Suman Kundu
Rajesh Sharma
136
0
0
31 Dec 2024
Comparative Analysis of Document-Level Embedding Methods for Similarity Scoring on Shakespeare Sonnets and Taylor Swift Lyrics
Klara Kramer
44
0
0
23 Dec 2024
Domain adapted machine translation: What does catastrophic forgetting forget and why?
Danielle Saunders
Steve DeNeefe
AI4CE
51
1
0
23 Dec 2024
HyperCLIP: Adapting Vision-Language models with Hypernetworks
Victor Akinwande
Mohammad Sadegh Norouzzadeh
Devin Willmott
Anna Bair
Madan Ravi Ganesh
J. Zico Kolter
CLIP
VLM
159
0
0
21 Dec 2024
PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time
Alireza Pourali
Arian Boukani
Hamzeh Khazaei
115
0
0
20 Dec 2024
Aria-UI: Visual Grounding for GUI Instructions
Yuhao Yang
Yue Wang
Dongxu Li
Ziyang Luo
Bei Chen
Chenyu Huang
Junnan Li
LM&Ro
LLMAG
178
33
0
20 Dec 2024
Is Peer-Reviewing Worth the Effort?
Kenneth Ward Church
Raman Chandrasekar
John E. Ortega
Ibrahim Said Ahmad
OOD
116
3
0
18 Dec 2024
On Enhancing Root Cause Analysis with SQL Summaries for Failures in Database Workload Replays at SAP HANA
Neetha Jambigi
Joshua Hammesfahr
Moritz Mueller
Thomas Bach
Michael Felderer
96
0
0
18 Dec 2024
LLMs are Also Effective Embedding Models: An In-depth Overview
Chongyang Tao
Tao Shen
Shen Gao
Junshuo Zhang
Zhen Li
Zhengwei Tao
Shuai Ma
143
11
0
17 Dec 2024
Knowledge Migration Framework for Smart Contract Vulnerability Detection
Luqi Wang
Wenbao Jiang
133
0
0
15 Dec 2024
Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection
Ahmed Haj Ahmed
Rui-Jie Yew
Xerxes Minocher
Suresh Venkatasubramanian
102
0
0
14 Dec 2024
GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek
Lefteris Loukas
Nikolaos Smyrnioudis
Chrysa Dikonomaki
Spyros Barbakos
Anastasios Toumazatos
...
Manolis Kyriakakis
Mary Georgiou
Stavros Vassos
John Pavlopoulos
Ion Androutsopoulos
AILaw
151
0
0
11 Dec 2024
From communities to interpretable network and word embedding: an unified approach
Thibault Prouteau
Nicolas Dugué
Simon Guillot
GNN
109
1
0
11 Dec 2024
Bilingual BSARD: Extending Statutory Article Retrieval to Dutch
Ehsan Lotfi
Nikolay Banar
Nerses Yuzbashyan
Walter Daelemans
AILaw
105
1
0
10 Dec 2024
ORIS: Online Active Learning Using Reinforcement Learning-based Inclusive Sampling for Robust Streaming Analytics System
Rahul Pandey
Ziwei Zhu
Hemant Purohit
101
0
0
27 Nov 2024
FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web
Cheng-Wei Lin
Wan-Hsuan Hsieh
Kai-Xin Guan
Chan-Jan Hsu
Chia-Chen Kuo
Chuan-Lin Lai
Chung-Wei Chung
Ming-Jen Wang
Da-shan Shiu
82
1
0
25 Nov 2024
Writing Style Matters: An Examination of Bias and Fairness in Information Retrieval Systems
Hongliu Cao
134
4
0
20 Nov 2024
Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions
Robin Carpentier
B. Zhao
Hassan Jameel Asghar
Dali Kaafar
144
1
0
18 Nov 2024
HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings
Anton M. Alekseev
Gulnara Kabaeva
28
0
0
16 Nov 2024
A Practical Guide to Fine-tuning Language Models with Limited Data
Márton Szép
Daniel Rueckert
Rüdiger von Eisenhart-Rothe
Florian Hinterwimmer
SyDa
ALM
135
2
0
14 Nov 2024
A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information
Prashant Kapil
Asif Ekbal
81
0
0
11 Nov 2024
Investigating Idiomaticity in Word Representations
Wei He
Tiago Kramer Vieira
Marcos García
Carolina Scarton
M. Idiart
Aline Villavicencio
99
1
0
04 Nov 2024
Zipfian Whitening
Sho Yokoi
Han Bao
Hiroto Kurita
Hidetoshi Shimodaira
64
0
0
01 Nov 2024
MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees
Ryan Zhang
Herbert Woisetschläger
Shiqiang Wang
Hans-Arno Jacobsen
36
0
0
31 Oct 2024
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages
Amir Hossein Kargaran
François Yvon
Hinrich Schutze
VLM
129
8
0
31 Oct 2024
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers
Lam Nguyen Tung
Steven Cho
Xiaoning Du
Neelofar Neelofar
Valerio Terragni
Stefano Ruberto
Aldeida Aleti
549
2
0
30 Oct 2024
RELATE: A Modern Processing Platform for Romanian Language
V. Pais
Radu Ion
Andrei-Marius Avram
Maria Mitrofan
D. Tufis
VLM
38
0
0
29 Oct 2024
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Xun Guo
Shan Zhang
Yongxin He
Ting Zhang
Wanquan Feng
Haibin Huang
Chongyang Ma
DeLMO
93
10
0
28 Oct 2024
Measuring individual semantic networks: A simulation study
Samuel Aeschbach
Rui Mata
Dirk U. Wulff
18
0
0
23 Oct 2024
MojoBench: Language Modeling and Benchmarks for Mojo
Nishat Raihan
Joanna C. S. Santos
Marcos Zampieri
86
2
0
23 Oct 2024
ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
Elyas Obbad
Iddah Mlauzi
Alycia Lee
Rylan Schaeffer
Kamal Obbad
Suhana Bedi
Sanmi Koyejo
CVBM
146
0
0
23 Oct 2024
LightFusionRec: Lightweight Transformers-Based Cross-Domain Recommendation Model
Vansh Kharidia
Dhruvi Paprunia
Prashasti Kanikar
16
0
0
21 Oct 2024
Previous
1
2
3
4
5
...
52
53
54
Next