ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.05100
  4. Cited By
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

9 November 2022
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
Ellie Pavlick
Suzana Ilić
Daniel Hesslow
Roman Castagné
A. Luccioni
François Yvon
Matthias Gallé
J. Tow
Alexander M. Rush
Stella Biderman
Albert Webson
Pawan Sasanka Ammanamanchi
Thomas Wang
Benoît Sagot
Niklas Muennighoff
Albert Villanova del Moral
Olatunji Ruwase
Rachel Bawden
Stas Bekman
Angelina McMillan-Major
Iz Beltagy
Huu Nguyen
Lucile Saulnier
Samson Tan
Pedro Ortiz Suarez
Victor Sanh
Hugo Laurenccon
Yacine Jernite
Julien Launay
Margaret Mitchell
Colin Raffel
Aaron Gokaslan
Adi Simhi
Aitor Soroa Etxabe
Alham Fikri Aji
Amit Alfassy
Anna Rogers
Ariel Kreisberg Nitzav
Canwen Xu
Chenghao Mou
Chris C. Emezue
Christopher Klamm
Colin Leong
Daniel Alexander van Strien
David Ifeoluwa Adelani
Dragomir R. Radev
E. G. Ponferrada
Efrat Levkovizh
Ethan Kim
Eyal Natan
F. Toni
Gérard Dupont
Germán Kruszewski
Giada Pistilli
Hady ElSahar
Hamza Benyamina
H. Tran
Ian Yu
Idris Abdulmumin
Isaac Johnson
Itziar Gonzalez-Dios
Javier de la Rosa
Jenny Chim
Jesse Dodge
Jian Zhu
Jonathan Chang
Jorg Frohberg
Josephine Tobing
J. Bhattacharjee
Khalid Almubarak
Kimbo Chen
Kyle Lo
Leandro von Werra
Leon Weber
Long Phan
Loubna Ben Allal
Ludovic Tanguy
Manan Dey
M. Muñoz
Maraim Masoud
María Grandury
Mario vSavsko
Max Huang
Maximin Coavoux
Mayank Singh
Mike Tian-Jian Jiang
Minh Chien Vu
M. A. Jauhar
Mustafa Ghaleb
Nishant Subramani
Nora Kassner
Nurulaqilla Khamis
Olivier Nguyen
Omar Espejel
Ona de Gibert
Paulo Villegas
Peter Henderson
Pierre Colombo
Priscilla Amuok
Quentin Lhoest
Rheza Harliman
Rishi Bommasani
R. López
Rui Ribeiro
Salomey Osei
S. Pyysalo
Sebastian Nagel
Shamik Bose
Shamsuddeen Hassan Muhammad
Shanya Sharma
Shayne Longpre
Somaieh Nikpoor
S. Silberberg
S. Pai
S. Zink
Tiago Timponi Torrent
Timo Schick
Tristan Thrush
V. Danchev
Vassilina Nikoulina
Veronika Laippala
Violette Lepercq
V. Prabhu
Zaid Alyafeai
Zeerak Talat
Arun Raja
Benjamin Heinzerling
Chenglei Si
Davut Emre Taşar
Elizabeth Salesky
Sabrina J. Mielke
Wilson Y. Lee
Abheesht Sharma
Andrea Santilli
Antoine Chaffin
Arnaud Stiegler
Debajyoti Datta
Eliza Szczechla
Gunjan Chhablani
Han Wang
Harshit Pandey
Hendrik Strobelt
Jason Alan Fries
Jos Rozen
Leo Gao
Lintang Sutawika
M Saiful Bari
Maged S. Al-Shaibani
Matteo Manica
Nihal V. Nayak
Ryan Teehan
Samuel Albanie
Sheng Shen
Srulik Ben-David
Stephen H. Bach
Taewoon Kim
T. Bers
Thibault Févry
Trishala Neeraj
Urmish Thakker
Vikas Raunak
Xiang Tang
Zheng-Xin Yong
Zhiqing Sun
Shaked Brody
Y. Uri
Hadar Tojarieh
Adam Roberts
Hyung Won Chung
Jaesung Tae
Jason Phang
Ofir Press
Conglong Li
Deepak Narayanan
Hatim Bourfoune
Jared Casper
Jeff Rasley
Max Ryabinin
Mayank Mishra
Minjia Zhang
Mohammad Shoeybi
Myriam Peyrounette
N. Patry
Nouamane Tazi
Omar Sanseviero
Patrick von Platen
Pierre Cornette
Pierre Franccois Lavallée
Rémi Lacroix
Samyam Rajbhandari
Sanchit Gandhi
Shaden Smith
S. Requena
Suraj Patil
Tim Dettmers
Ahmed Baruwa
Amanpreet Singh
Anastasia Cheveleva
Anne-Laure Ligozat
Arjun Subramonian
Aurélie Névéol
Charles Lovering
Daniel H Garrette
D. Tunuguntla
Ehud Reiter
Ekaterina Taktasheva
E. Voloshina
Eli Bogdanov
Genta Indra Winata
Hailey Schoelkopf
Jan-Christoph Kalo
Jekaterina Novikova
Jessica Zosa Forde
Zdenvek Kasner
Jungo Kasai
Ken Kawamura
Liam Hazan
Marine Carpuat
Miruna Clinciu
Najoung Kim
Newton Cheng
O. Serikov
Omer Antverg
Oskar van der Wal
Rui Zhang
Ruochen Zhang
Sebastian Gehrmann
Shachar Mirkin
S. Pais
Tatiana Shavrina
Thomas Scialom
Tian Yun
Tomasz Limisiewicz
Verena Rieser
Vitaly Protasov
Vladislav Mikhailov
Yada Pruksachatkun
Yonatan Belinkov
Zachary Bamberger
Zdeněk Kasner
Xiangru Tang
A. Pestana
A. Feizpour
Ammar Khan
Amy Faranak
A. Santos
Anthony Hevia
Antigona Unldreaj
Arash Aghagol
Arezoo Abdollahi
A. Tammour
A. HajiHosseini
Bahareh Behroozi
Benjamin Ayoade Ajibade
B. Saxena
Carlos Muñoz Ferrandis
Daniel McDuff
Danish Contractor
D. Lansky
Davis David
Douwe Kiela
D. A. Nguyen
Edward Tan
Emi Baylor
Ezinwanne Ozoani
F. Mirza
Frankline Ononiwu
Habib Rezanejad
H.A. Jones
Indrani Bhattacharya
Irene Solaiman
Irina Sedenko
I. Nejadgholi
J. Passmore
Joshua Seltzer
Julio Bonis Sanz
Lívia Dutra
Mairon Samagaio
Maraim Elbadri
Margot Mieskes
Marissa Gerchick
Martha Akinlolu
Michael McKenna
Mike Qiu
M. Ghauri
Mykola Burynok
Nafis Abrar
Nazneen Rajani
Nour Elkott
N. Fahmy
Olanrewaju Samuel
Ran An
R. Kromann
Ryan Hao
S. Alizadeh
Sarmad Shubber
Silas L. Wang
Sourav Roy
S. Viguier
Thanh-Cong Le
Tobi Oyebade
T. Le
Yoyo Yang
Zach Nguyen
Abhinav Ramesh Kashyap
Alfredo Palasciano
A. Callahan
Anima Shukla
Antonio Miranda-Escalada
A. Singh
Benjamin Beilharz
Bo Wang
C. Brito
Chenxi Zhou
Chirag Jain
Chuxin Xu
Clémentine Fourrier
Daniel León Perinán
Daniel Molano
Dian Yu
Enrique Manjavacas
Fabio Barth
Florian Fuhrimann
Gabriel Altay
Giyaseddin Bayrak
Gully Burns
Helena U. Vrabec
I. Bello
Isha Dash
J. Kang
John Giorgi
Jonas Golde
J. Posada
Karthi Sivaraman
Lokesh Bulchandani
Lu Liu
Luisa Shinzato
Madeleine Hahn de Bykhovetz
Maiko Takeuchi
Marc Pàmies
M. A. Castillo
Marianna Nezhurina
Mario Sanger
Matthias Samwald
Michael Cullan
Michael Weinberg
M. Wolf
Mina Mihaljcic
Minna Liu
M. Freidank
Myungsun Kang
Natasha Seelam
N. Dahlberg
N. Broad
N. Muellner
Pascale Fung
Patrick Haller
Patricia Haller
R. Eisenberg
Robert Martin
Rodrigo Canalli
Rosaline Su
Ruisi Su
Samuel Cahyawijaya
Samuele Garda
Shlok S Deshmukh
Shubhanshu Mishra
Sid Kiblawi
Simon Ott
Sinee Sang-aroonsiri
Srishti Kumar
Stefan Schweter
S. Bharati
Tanmay Laud
Théo Gigant
Tomoya Kainuma
Wojciech Kusa
Yanis Labrak
Yashasvi Bajaj
Y. Venkatraman
Yifan Xu
Ying Xu
Yu Xu
Z. Tan
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
    VLM
ArXivPDFHTML

Papers citing "BLOOM: A 176B-Parameter Open-Access Multilingual Language Model"

50 / 1,625 papers shown
Title
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
Harika Abburi
Kalyani Roy
Michael Suesserman
Nirmala Pudota
Balaji Veeramani
Edward Bowen
Sanmitra Bhattacharya
DeLMO
45
10
0
06 Nov 2023
PhoGPT: Generative Pre-training for Vietnamese
PhoGPT: Generative Pre-training for Vietnamese
Dat Quoc Nguyen
L. T. Nguyen
Chi Tran
Dung Ngoc Nguyen
D.Q. Phung
Hung Bui
36
9
0
06 Nov 2023
FaMeSumm: Investigating and Improving Faithfulness of Medical
  Summarization
FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization
Nan Zhang
Yusen Zhang
Wu Guo
P. Mitra
Rui Zhang
HILM
43
4
0
03 Nov 2023
Post Turing: Mapping the landscape of LLM Evaluation
Post Turing: Mapping the landscape of LLM Evaluation
Alexey Tikhonov
Ivan P. Yamshchikov
ELM
66
4
0
03 Nov 2023
The language of prompting: What linguistic properties make a prompt
  successful?
The language of prompting: What linguistic properties make a prompt successful?
Alina Leidinger
R. Rooij
Ekaterina Shutova
46
43
0
03 Nov 2023
Indicative Summarization of Long Discussions
Indicative Summarization of Long Discussions
S. Syed
Dominik Schwabe
Khalid Al Khatib
Martin Potthast
36
1
0
03 Nov 2023
Sentiment Analysis through LLM Negotiations
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Fei Wu
Jiwei Li
Tianwei Zhang
Guoyin Wang
50
16
0
03 Nov 2023
Towards Concept-Aware Large Language Models
Towards Concept-Aware Large Language Models
Chen Shani
Jilles Vreeken
Dafna Shahaf
LRM
30
6
0
03 Nov 2023
$R^3$-NL2GQL: A Model Coordination and Knowledge Graph Alignment
  Approach for NL2GQL
R3R^3R3-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL
Yuhang Zhou
Yu He
Siyu Tian
Yuchen Ni
Zhangyue Yin
...
Chuanjun Ji
Sen Liu
Xipeng Qiu
Guangnan Ye
Hongfeng Chai
32
5
0
03 Nov 2023
Large Language Models to the Rescue: Reducing the Complexity in
  Scientific Workflow Development Using ChatGPT
Large Language Models to the Rescue: Reducing the Complexity in Scientific Workflow Development Using ChatGPT
Mario Sanger
Ninon De Mecquenem
Katarzyna Ewa Lewiñska
Vasilis Bountris
Fabian Lehmann
Ulf Leser
Thomas Kosch
41
4
0
03 Nov 2023
AFPQ: Asymmetric Floating Point Quantization for LLMs
AFPQ: Asymmetric Floating Point Quantization for LLMs
Yijia Zhang
Sicheng Zhang
Shijie Cao
Dayou Du
Jianyu Wei
Ting Cao
Ningyi Xu
MQ
33
6
0
03 Nov 2023
TCM-GPT: Efficient Pre-training of Large Language Models for Domain
  Adaptation in Traditional Chinese Medicine
TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine
Guoxing Yang
Jianyu Shi
Zan Wang
Xiaohong Liu
Guangyu Wang
27
17
0
03 Nov 2023
FinGPT: Large Generative Models for a Small Language
FinGPT: Large Generative Models for a Small Language
Risto Luukkonen
Ville Komulainen
Jouni Luoma
Anni Eskelinen
Jenna Kanerva
...
Mikko Merioksa
Jyrki Heinonen
Aija Vahtola
Samuel Antao
S. Pyysalo
LM&MA
28
42
0
03 Nov 2023
Market Concentration Implications of Foundation Models
Market Concentration Implications of Foundation Models
Jai Vipra
Anton Korinek
ELM
45
16
0
02 Nov 2023
Continual Learning Under Language Shift
Continual Learning Under Language Shift
Evangelia Gogoulou
Timothée Lesort
Magnus Boman
Joakim Nivre
KELM
CLL
51
4
0
02 Nov 2023
ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with
  Effective Evaluation Model
ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model
Jianghao Chen
Pu Jian
Tengxiao Xi
Yidong Yi
Qianlong Du
Chenglin Ding
Guibo Zhu
Chengqing Zong
Jinqiao Wang
Jiajun Zhang
43
7
0
02 Nov 2023
Learning A Multi-Task Transformer Via Unified And Customized Instruction
  Tuning For Chest Radiograph Interpretation
Learning A Multi-Task Transformer Via Unified And Customized Instruction Tuning For Chest Radiograph Interpretation
Lijian Xu
Ziyu Ni
Xinglong Liu
Xiaosong Wang
Hongsheng Li
Shaoting Zhang
MedIm
LM&MA
32
4
0
02 Nov 2023
Multi-dimensional data refining strategy for effective fine-tuning LLMs
Multi-dimensional data refining strategy for effective fine-tuning LLMs
Thanh Nguyen Ngoc
Q. Tran
Arthur Tang
Bao Nguyen
Thuy Nguyen
Thanh Pham
23
0
0
02 Nov 2023
Attention Alignment and Flexible Positional Embeddings Improve
  Transformer Length Extrapolation
Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation
Ta-Chung Chi
Ting-Han Fan
Alexander I. Rudnicky
30
4
0
01 Nov 2023
CROMA: Remote Sensing Representations with Contrastive Radar-Optical
  Masked Autoencoders
CROMA: Remote Sensing Representations with Contrastive Radar-Optical Masked Autoencoders
A. Fuller
K. Millard
James R. Green
39
62
0
01 Nov 2023
Representativeness as a Forgotten Lesson for Multilingual and
  Code-switched Data Collection and Preparation
Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation
A. Seza Doğruöz
Sunayana Sitaram
Zheng-Xin Yong
51
13
0
31 Oct 2023
InstructCoder: Instruction Tuning Large Language Models for Code Editing
InstructCoder: Instruction Tuning Large Language Models for Code Editing
Kaixin Li
Qisheng Hu
Xu Zhao
Hui Chen
Yuxi Xie
Tiedong Liu
Qizhe Xie
Junxian He
ALM
SyDa
44
12
0
31 Oct 2023
Theory of Mind in Large Language Models: Examining Performance of 11
  State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests
Theory of Mind in Large Language Models: Examining Performance of 11 State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests
Max J. van Duijn
Bram van Dijk
Tom Kouwenhoven
Werner de Valk
M. Spruit
P. V. D. Putten
ELM
LRM
33
28
0
31 Oct 2023
CreoleVal: Multilingual Multitask Benchmarks for Creoles
CreoleVal: Multilingual Multitask Benchmarks for Creoles
Heather Lent
Kushal Tatariya
Raj Dabre
Yiyi Chen
Marcell Richard Fekete
...
Miryam de Lhoneux
Daniel Hershcovich
Michel DeGraff
Anders Sogaard
Johannes Bjerva
SLR
43
9
0
30 Oct 2023
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
  Modeling Likewise
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Nan He
Hanyu Lai
Chenyang Zhao
Zirui Cheng
Junting Pan
...
Zhaohui Hou
Zhiyuan Huang
Shaoqing Lu
Ding Liang
Mingjie Zhan
LRM
29
13
0
29 Oct 2023
FP8-LM: Training FP8 Large Language Models
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
59
40
0
27 Oct 2023
Cultural Adaptation of Recipes
Cultural Adaptation of Recipes
Yong Cao
Yova Kementchedjhieva
Ruixiang Cui
Antonia Karamolegkou
Li Zhou
Megan Dare
Lucia Donatelli
Daniel Hershcovich
23
5
0
26 Oct 2023
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing
  & Attribution in AI
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
...
K. Bollacker
Tongshuang Wu
Luis Villa
Sandy Pentland
Sara Hooker
32
56
0
25 Oct 2023
DEFT: Data Efficient Fine-Tuning for Pre-Trained Language Models via
  Unsupervised Core-Set Selection
DEFT: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection
Devleena Das
Vivek Khetan
34
1
0
25 Oct 2023
Multiple Key-value Strategy in Recommendation Systems Incorporating
  Large Language Model
Multiple Key-value Strategy in Recommendation Systems Incorporating Large Language Model
Dui Wang
Xiangyu Hou
Xiaohui Yang
Bo Zhang
Renbing Chen
Daiyue Xue
KELM
29
3
0
25 Oct 2023
LlamaRec: Two-Stage Recommendation using Large Language Models for
  Ranking
LlamaRec: Two-Stage Recommendation using Large Language Models for Ranking
Zhenrui Yue
Sara Rabhi
Gabriel de Souza P. Moreira
Dong Wang
Even Oldridge
LRM
53
36
0
25 Oct 2023
Locally Differentially Private Document Generation Using Zero Shot
  Prompting
Locally Differentially Private Document Generation Using Zero Shot Prompting
Saiteja Utpala
Sara Hooker
Pin-Yu Chen
26
38
0
24 Oct 2023
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of
  LLMs through a Global Scale Prompt Hacking Competition
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Sander Schulhoff
Jeremy Pinto
Anaum Khan
Louis-Franccois Bouchard
Chenglei Si
Svetlina Anati
Valen Tagliabue
Anson Liu Kost
Christopher Carnahan
Jordan L. Boyd-Graber
SILM
44
41
0
24 Oct 2023
E-Sparse: Boosting the Large Language Model Inference through
  Entropy-based N:M Sparsity
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity
Yun Li
Lin Niu
Xipeng Zhang
Kai Liu
Jianchen Zhu
Zhanhui Kang
MoE
40
11
0
24 Oct 2023
SoK: Memorization in General-Purpose Large Language Models
SoK: Memorization in General-Purpose Large Language Models
Valentin Hartmann
Anshuman Suri
Vincent Bindschaedler
David Evans
Shruti Tople
Robert West
KELM
LLMAG
29
21
0
24 Oct 2023
MindLLM: Pre-training Lightweight Large Language Model from Scratch,
  Evaluations and Domain Applications
MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications
Yizhe Yang
Huashan Sun
Jiawei Li
Runheng Liu
Yinghao Li
Yuhang Liu
Heyan Huang
Yang Gao
ALM
LRM
16
8
0
24 Oct 2023
BLESS: Benchmarking Large Language Models on Sentence Simplification
BLESS: Benchmarking Large Language Models on Sentence Simplification
Tannon Kew
Alison Chi
Laura Vásquez-Rodríguez
Sweta Agrawal
Dennis Aumiller
Fernando Alva-Manchego
Teven Le Scao
53
23
0
24 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme
  Large Language Model Compression
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
32
4
0
24 Oct 2023
TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for
  Inference Cost Reduction
TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction
Junyi Liu
Liangzhi Li
Tong Xiang
Bowen Wang
Yiming Qian
41
31
0
24 Oct 2023
Breaking the Language Barrier: Improving Cross-Lingual Reasoning with
  Structured Self-Attention
Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention
Negar Foroutan
Mohammadreza Banaei
Karl Aberer
Antoine Bosselut
LRM
25
3
0
23 Oct 2023
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
Mete Ismayilzada
Debjit Paul
Syrielle Montariol
Mor Geva
Antoine Bosselut
LRM
39
5
0
23 Oct 2023
LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
Shih-Chieh Dai
Aiping Xiong
Lun-Wei Ku
32
68
0
23 Oct 2023
Meta learning with language models: Challenges and opportunities in the
  classification of imbalanced text
Meta learning with language models: Challenges and opportunities in the classification of imbalanced text
Apostol T. Vassilev
Honglan Jin
Munawar Hasan
29
0
0
23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for
  Large Language Models
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
34
29
0
23 Oct 2023
Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study
  on Syllogism
Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism
Mengyu Ye
Tatsuki Kuribayashi
Jun Suzuki
Goro Kobayashi
Hiroaki Funayama
LRM
36
8
0
23 Oct 2023
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple
  Experts Fine-tuning
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning
Wei Chen
Qiushi Wang
Zefei Long
Xianyin Zhang
Zhongtian Lu
...
Siyuan Wang
Jiarong Xu
Xiang Bai
Xuanjing Huang
Zhongyu Wei
83
43
0
23 Oct 2023
Geographical Erasure in Language Generation
Geographical Erasure in Language Generation
Pola Schwöbel
Jacek Golebiowski
Michele Donini
Cédric Archambeau
Danish Pruthi
26
5
0
23 Oct 2023
Conversational Recommender System and Large Language Model Are Made for
  Each Other in E-commerce Pre-sales Dialogue
Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue
Yuanxing Liu
Wei-Nan Zhang
Yifan Chen
Yuchi Zhang
Haopeng Bai
Fan Feng
Hengbin Cui
Yongbin Li
Wanxiang Che
43
21
0
23 Oct 2023
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64
  Languages
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
Chiyu Zhang
Khai Duy Doan
Qisheng Liao
Muhammad Abdul-Mageed
43
6
0
23 Oct 2023
PRCA: Fitting Black-Box Large Language Models for Retrieval Question
  Answering via Pluggable Reward-Driven Contextual Adapter
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter
Haoyan Yang
Zhitao Li
Yong Zhang
Jianzong Wang
Ning Cheng
Ming Li
Jing Xiao
RALM
19
29
0
23 Oct 2023
Previous
123...181920...313233
Next