Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.09359
Cited By
Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation
20 September 2020
Tahmid Hasan
Abhik Bhattacharjee
Kazi Samin Mubasshir
Masum Hasan
Madhusudan Basak
M. Rahman
Rifat Shahriyar
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation"
42 / 42 papers shown
Title
TituLLMs: A Family of Bangla LLMs with Comprehensive Benchmarking
Shahriar Kabir Nahin
R. N. Nandi
Sagor Sarker
Quazi Sarwar Muhtaseem
Md. Kowsher
Apu Chandraw Shill
Md Ibrahim
Mehadi Hasan Menon
Tareq Al Muntasir
Firoj Alam
68
0
0
24 Feb 2025
BeliN: A Novel Corpus for Bengali Religious News Headline Generation using Contextual Feature Fusion
Md Osama
Ashim Dey
Kawsar Ahmed
Muhammad Ashad Kabir
53
0
0
03 Jan 2025
Language verY Rare for All
Ibrahim Merad
Amos Wolf
Ziad Mazzawi
Yannick Léo
72
0
0
18 Dec 2024
Multi-ToM: Evaluating Multilingual Theory of Mind Capabilities in Large Language Models
Jayanta Sadhu
Ayan Antik Khan
Noshin Nawal
Sanju Basak
Abhik Bhattacharjee
Rifat Shahriyar
71
0
0
24 Nov 2024
BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques
Muhammad Rafsan Kabir
Md. Mohibur Rahman Nabil
Mohammad Ashrafuzzaman Khan
64
0
0
22 Nov 2024
The Zeno's Paradox of `Low-Resource' Languages
H. Nigatu
A. Tonja
Benjamin Rosman
Thamar Solorio
Monojit Choudhury
135
5
0
28 Oct 2024
Better to Ask in English: Evaluation of Large Language Models on English, Low-resource and Cross-Lingual Settings
Krishno Dey
Prerona Tarannum
Md. Arid Hasan
Imran Razzak
Usman Naseem
35
3
0
17 Oct 2024
ChakmaNMT: A Low-resource Machine Translation On Chakma Language
Aunabil Chakma
Aditya Chakma
Soham Khisa
Chumui Tripura
Masum Hasan
Rifat Shahriyar
16
0
0
14 Oct 2024
Table Question Answering for Low-resourced Indic Languages
Vaishali Pal
Evangelos Kanoulas
Andrew Yates
Maarten de Rijke
LMTD
31
0
0
04 Oct 2024
A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representations
Nidhi Kowtal
Tejas Deshpande
Raviraj Joshi
15
1
0
04 Sep 2024
An Empirical Study of Gendered Stereotypes in Emotional Attributes for Bangla in Multilingual Large Language Models
Jayanta Sadhu
Maneesha Rani Saha
Rifat Shahriyar
34
0
0
08 Jul 2024
Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs
Tamzeed Mahfuz
Satak Kumar Dey
Ruwad Naswan
Hasnaen Adil
Khondker Salman Sayeed
Haz Sameen Shahgir
33
0
0
29 Jun 2024
An Empirical Study on the Characteristics of Bias upon Context Length Variation for Bangla
Jayanta Sadhu
Ayan Antik Khan
Abhik Bhattacharjee
Rifat Shahriyar
30
2
0
25 Jun 2024
Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
Kamal Kumar
Yinhan Liu
Parth Patwa
Tanmoy
Mihir Adam Roberts
19
1
0
25 Mar 2024
Explainable Multimodal Sentiment Analysis on Bengali Memes
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Sajib Kumar Saha Joy
Faisal Muhammad Shah
25
1
0
20 Dec 2023
Hate Speech and Offensive Content Detection in Indo-Aryan Languages: A Battle of LSTM and Transformers
Nikhil Narayan
Mrutyunjay Biswal
Pramod Goyal
Abhranta Panigrahi
VLM
14
3
0
09 Dec 2023
Vashantor: A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
Fatema Tuj Johora Faria
Mukaffi Bin Moin
Ahmed Al Wase
Mehidi Ahmmed
Md. Rabius Sani
Tashreef Muhammad
22
5
0
18 Nov 2023
BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset
Motahar Mahtab
Monirul Haque
Mehedi Hasan
Farig Sadeque
11
1
0
10 Nov 2023
RSM-NLP at BLP-2023 Task 2: Bangla Sentiment Analysis using Weighted and Majority Voted Fine-Tuned Transformers
Pratinav Seth
Rashi Goel
Komal Mathur
Swetha Vemulapalli
18
1
0
22 Oct 2023
Bengali Fake Reviews: A Benchmark Dataset and Detection System
G. M. Shahariar
Rouf Shawon
F. Shah
Mohammad Shafiul Alam
Md. Shahriar Mahbub
25
4
0
03 Aug 2023
Model Adaptation for ASR in low-resource Indian Languages
Abhayjeet Singh
Arjun Singh Mehta
S. AshishKhuraishiK.
G. Deekshitha
Gauri Date
...
Priyanka Pai
Raoul Nanavati
Rohan Saxena
Sai Praneeth Reddy Mora
Srinivasa Raghavan
19
9
0
16 Jul 2023
Tackling Fake News in Bengali: Unraveling the Impact of Summarization vs. Augmentation on Pre-trained Language Models
Arman Sakif Chowdhury
G. M. Shahariar
Ahammed Tarik Aziz
Syed Mohibul Alam
Md. Azad Sheikh
Tanveer Ahmed Belal
11
0
0
13 Jul 2023
Vacaspati: A Diverse Corpus of Bangla Literature
Pramit Bhattacharyya
Joydeep Mondal
S. Maji
Arnab Bhattacharya
37
5
0
11 Jul 2023
CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Md Mahfuz Ibn Alam
Sina Ahmadi
Antonios Anastasopoulos
23
6
0
26 May 2023
On Evaluation of Bangla Word Analogies
Mousumi Akter
Souvik Sarkar
S. Karmaker
14
3
0
10 Apr 2023
Bengali Fake Review Detection using Semi-supervised Generative Adversarial Networks
Md. Tanvir Rouf Shawon
G. M. Shahariar
F. Shah
Mohammad Shafiul Alam
Md. Shahriar Mahbub
20
4
0
05 Apr 2023
BanglaCoNER: Towards Robust Bangla Complex Named Entity Recognition
Haz Sameen Shahgir
Ramisa Alam
Md Zarif Ul Alam
9
2
0
16 Mar 2023
JamPatoisNLI: A Jamaican Patois Natural Language Inference Dataset
Ruth-Ann Armstrong
John Hewitt
Christopher D. Manning
30
14
0
07 Dec 2022
Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages
Idris Abdulmumin
Michael Beukman
Jesujoba Oluwadara Alabi
Chris C. Emezue
Everlyn Asiko
...
Shamsuddeen Hassan Muhammad
Mofetoluwa Adeyemi
Oreen Yousuf
Sahib Singh
T. Gwadabe
34
6
0
19 Oct 2022
BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset
Ajwad Akil
Najrin Sultana
Abhik Bhattacharjee
Rifat Shahriyar
32
15
0
11 Oct 2022
BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
16
6
0
28 May 2022
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Rifat Shahriyar
AIMat
LM&MA
39
28
0
23 May 2022
TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla
Nazia Tasnim
Md. Istiak Hossain Shihab
Asif Sushmit
Steven Bethard
Farig Sadeque
24
1
0
21 Apr 2022
Recent Advances in Neural Text Generation: A Task-Agnostic Survey
Chen Tang
Frank Guerin
Chenghua Lin
AI4CE
OOD
28
19
0
06 Mar 2022
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
31
148
0
01 Sep 2021
A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models
Firoj Alam
Md. Arid Hasan
Tanvirul Alam
A. Khan
Janntatul Tajrin
Naira Khan
Shammur A. Chowdhury
LM&MA
22
23
0
08 Jul 2021
Don't Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data
Rajat Bhatnagar
Ananya Ganesh
Katharina Kann
13
2
0
12 Jun 2021
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
Gowtham Ramesh
Sumanth Doddapaneni
Aravinth Bheemaraj
Mayank Jobanputra
AK Raghavan
...
K. Deepak
Vivek Raghavan
Anoop Kunchukuttan
Pratyush Kumar
Mitesh Khapra
LRM
37
229
0
12 Apr 2021
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Kazi Samin Mubasshir
Md. Saiful Islam
Anindya Iqbal
M. Rahman
Rifat Shahriyar
SSL
VLM
25
166
0
01 Jan 2021
Six Challenges for Neural Machine Translation
Philipp Koehn
Rebecca Knowles
AAML
AIMat
224
1,208
0
12 Jun 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
262
1,896
0
10 Jan 2017
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
1