ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.08786
  4. Cited By
Efficient Hierarchical Domain Adaptation for Pretrained Language Models

Efficient Hierarchical Domain Adaptation for Pretrained Language Models

16 December 2021
Alexandra Chronopoulou
Matthew E. Peters
Jesse Dodge
ArXivPDFHTML

Papers citing "Efficient Hierarchical Domain Adaptation for Pretrained Language Models"

32 / 32 papers shown
Title
Adapter-based Approaches to Knowledge-enhanced Language Models -- A
  Survey
Adapter-based Approaches to Knowledge-enhanced Language Models -- A Survey
Alexander Fichtl
Juraj Vladika
Georg Groh
KELM
83
0
0
25 Nov 2024
Towards Generalized Offensive Language Identification
Towards Generalized Offensive Language Identification
A. Dmonte
Tejas Arya
Tharindu Ranasinghe
Marcos Zampieri
52
3
0
26 Jul 2024
Towards Modular LLMs by Building and Reusing a Library of LoRAs
Towards Modular LLMs by Building and Reusing a Library of LoRAs
O. Ostapenko
Zhan Su
E. Ponti
Laurent Charlin
Nicolas Le Roux
Matheus Pereira
Lucas Caccia
Alessandro Sordoni
MoMe
44
31
0
18 May 2024
DALLMi: Domain Adaption for LLM-based Multi-label Classifier
DALLMi: Domain Adaption for LLM-based Multi-label Classifier
Miruna Betianu
Abele Malan
Marco Aldinucci
Robert Birke
Lydia Y. Chen
38
5
0
03 May 2024
Exploring the landscape of large language models: Foundations,
  techniques, and challenges
Exploring the landscape of large language models: Foundations, techniques, and challenges
M. Moradi
Ke Yan
David Colwell
Matthias Samwald
Rhona Asgari
OffRL
46
1
0
18 Apr 2024
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees
William Fleshman
Aleem Khan
Marc Marone
Benjamin Van Durme
CLL
KELM
58
3
0
12 Apr 2024
AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting
  Emergent Behavior
AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent Behavior
Zhouhong Gu
Xiaoxuan Zhu
Haoran Guo
Lin Zhang
Yin Cai
...
Yifei Dai
Yan Gao
Yao Hu
Hongwei Feng
Yanghua Xiao
AI4CE
50
1
0
20 Mar 2024
OLMo: Accelerating the Science of Language Models
OLMo: Accelerating the Science of Language Models
Dirk Groeneveld
Iz Beltagy
Pete Walsh
Akshita Bhagia
Rodney Michael Kinney
...
Jesse Dodge
Kyle Lo
Luca Soldaini
Noah A. Smith
Hanna Hajishirzi
OSLM
141
358
0
01 Feb 2024
Diversifying Knowledge Enhancement of Biomedical Language Models using
  Adapter Modules and Knowledge Graphs
Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs
Juraj Vladika
Alexander Fichtl
Florian Matthes
KELM
24
1
0
21 Dec 2023
Paloma: A Benchmark for Evaluating Language Model Fit
Paloma: A Benchmark for Evaluating Language Model Fit
Ian H. Magnusson
Akshita Bhagia
Valentin Hofmann
Luca Soldaini
A. Jha
...
Iz Beltagy
Hanna Hajishirzi
Noah A. Smith
Kyle Richardson
Jesse Dodge
132
21
0
16 Dec 2023
A Block Metropolis-Hastings Sampler for Controllable Energy-based Text
  Generation
A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation
Jarad Forristal
Niloofar Mireshghallah
Greg Durrett
Taylor Berg-Kirkpatrick
118
4
0
07 Dec 2023
Guiding Language Model Math Reasoning with Planning Tokens
Guiding Language Model Math Reasoning with Planning Tokens
Xinyi Wang
Lucas Caccia
O. Ostapenko
Xingdi Yuan
William Yang Wang
Alessandro Sordoni
LRM
33
2
0
09 Oct 2023
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Tom Sherborne
Naomi Saphra
Pradeep Dasigi
Hao Peng
32
4
0
05 Oct 2023
Population Expansion for Training Language Models with Private Federated
  Learning
Population Expansion for Training Language Models with Private Federated Learning
Tatsuki Koga
Congzheng Song
Martin Pelikan
Mona Chitnis
FedML
24
1
0
14 Jul 2023
From Adversarial Arms Race to Model-centric Evaluation: Motivating a
  Unified Automatic Robustness Evaluation Framework
From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Lifan Yuan
Dehan Kong
...
Longtao Huang
H. Xue
Zhiyuan Liu
Maosong Sun
Heng Ji
AAML
ELM
27
6
0
29 May 2023
Plug-and-Play Knowledge Injection for Pre-trained Language Models
Plug-and-Play Knowledge Injection for Pre-trained Language Models
Zhengyan Zhang
Zhiyuan Zeng
Yankai Lin
Huadong Wang
Deming Ye
...
Xu Han
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
KELM
43
10
0
28 May 2023
Plug-and-Play Document Modules for Pre-trained Models
Plug-and-Play Document Modules for Pre-trained Models
Chaojun Xiao
Zhengyan Zhang
Xu Han
Chi-Min Chan
Yankai Lin
Zhiyuan Liu
Xiangyang Li
Zhonghua Li
Bo Zhao
Maosong Sun
KELM
24
5
0
28 May 2023
GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based
  Adapters
GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters
Md Mahfuz Ibn Alam
Ruoyu Xie
Fahim Faisal
Antonios Anastasopoulos
32
3
0
25 Apr 2023
Scaling Expert Language Models with Unsupervised Domain Discovery
Scaling Expert Language Models with Unsupervised Domain Discovery
Suchin Gururangan
Margaret Li
M. Lewis
Weijia Shi
Tim Althoff
Noah A. Smith
Luke Zettlemoyer
MoE
25
46
0
24 Mar 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
E. Ponti
MoMe
OOD
32
73
0
22 Feb 2023
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained
  Language Models
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models
Alexandra Chronopoulou
Matthew E. Peters
Alexander Fraser
Jesse Dodge
MoMe
26
65
0
14 Feb 2023
Efficient Language Model Training through Cross-Lingual and Progressive
  Transfer Learning
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Malte Ostendorff
Georg Rehm
CLIP
VLM
CLL
41
23
0
23 Jan 2023
NLPeer: A Unified Resource for the Computational Study of Peer Review
NLPeer: A Unified Resource for the Computational Study of Peer Review
Nils Dycke
Ilia Kuznetsov
Iryna Gurevych
20
36
0
12 Nov 2022
On the Domain Adaptation and Generalization of Pretrained Language
  Models: A Survey
On the Domain Adaptation and Generalization of Pretrained Language Models: A Survey
Xu Guo
Han Yu
LM&MA
VLM
28
29
0
06 Nov 2022
M2D2: A Massively Multi-domain Language Modeling Dataset
M2D2: A Massively Multi-domain Language Modeling Dataset
Machel Reid
Victor Zhong
Suchin Gururangan
Luke Zettlemoyer
13
19
0
13 Oct 2022
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language
  Models
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Margaret Li
Suchin Gururangan
Tim Dettmers
M. Lewis
Tim Althoff
Noah A. Smith
Luke Zettlemoyer
MoMe
31
142
0
05 Aug 2022
Mixed-effects transformers for hierarchical adaptation
Mixed-effects transformers for hierarchical adaptation
Julia White
Noah D. Goodman
Robert D. Hawkins
18
2
0
03 May 2022
Mix and Match: Learning-free Controllable Text Generation using Energy
  Language Models
Mix and Match: Learning-free Controllable Text Generation using Energy Language Models
Fatemehsadat Mireshghallah
Kartik Goyal
Taylor Berg-Kirkpatrick
36
78
0
24 Mar 2022
Geographic Adaptation of Pretrained Language Models
Geographic Adaptation of Pretrained Language Models
Valentin Hofmann
Goran Glavavs
Nikola Ljubevsić
J. Pierrehumbert
Hinrich Schütze
VLM
21
16
0
16 Mar 2022
Time Waits for No One! Analysis and Challenges of Temporal Misalignment
Time Waits for No One! Analysis and Challenges of Temporal Misalignment
Kelvin Luu
Daniel Khashabi
Suchin Gururangan
Karishma Mandyam
Noah A. Smith
24
85
0
14 Nov 2021
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
261
4,489
0
23 Jan 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
1