Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.01432
Cited By
Semi-supervised Sequence Learning
4 November 2015
Andrew M. Dai
Quoc V. Le
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semi-supervised Sequence Learning"
50 / 177 papers shown
Title
Fine-Tuning Games: Bargaining and Adaptation for General-Purpose Models
Benjamin Laufer
Jon M. Kleinberg
Hoda Heidari
60
8
0
03 Jan 2025
Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken Language
Tomek Rutowski
Amir Harati
Elizabeth Shriberg
Yang Lu
Piotr Chlebek
Ricardo Oliveira
53
7
0
03 Jan 2025
Reasoning Elicitation in Language Models via Counterfactual Feedback
Alihan Hüyük
Xinnuo Xu
Jacqueline Maasch
Aditya V. Nori
Javier González
ReLM
LRM
190
1
0
02 Oct 2024
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
Yupeng Chen
Senmiao Wang
Zhihang Lin
Zhihang Lin
Yushun Zhang
Tian Ding
Ruoyu Sun
Ruoyu Sun
CLL
80
1
0
30 Jul 2024
Learning to Select the Best Forecasting Tasks for Clinical Outcome Prediction
Yuan Xue
Nan Du
A. Mottram
Martin G. Seneviratne
Andrew M. Dai
AI4TS
55
0
0
28 Jul 2024
Meta-Analysis with Untrusted Data
Shiva Kaul
Geoffrey J. Gordon
CML
32
1
0
12 Jul 2024
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo
Florian E. Dorner
Moritz Hardt
ELM
71
7
1
10 Jul 2024
Sim2Real in Reconstructive Spectroscopy: Deep Learning with Augmented Device-Informed Data Simulation
Jiyi Chen
Pengyu Li
Yutong Wang
Pei-Cheng Ku
Qing Qu
35
3
0
19 Mar 2024
Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT
Aisha Khatun
Anisur Rahman
Md. Saiful Islam
Hemayet Ahmed Chowdhury
A. Tasnim
31
2
0
08 Mar 2024
Data-efficient Large Vision Models through Sequential Autoregression
Jianyuan Guo
Zhiwei Hao
Chengcheng Wang
Yehui Tang
Han Wu
Han Hu
Kai Han
Chang Xu
VLM
38
10
0
07 Feb 2024
Saturn Platform: Foundation Model Operations and Generative AI for Financial Services
Antonio Busson
Rennan Gaio
Rafael H. Rocha
Francisco Evangelista
Bruno Rizzi
Luan Carvalho
Rafael Miceli
Marcos Rabaioli
David Favaro
28
1
0
12 Dec 2023
Deep Learning-based Sentiment Classification: A Comparative Survey
Mohammed Kayed
R. Redondo
Alhassan Mabrouk
14
41
0
12 Dec 2023
Self-Supervised Pre-Training Boosts Semantic Scene Segmentation on LiDAR Data
Mariona Carós
Ariadna Just
Santi Seguí
Jordi Vitrià
SSL
3DPC
24
3
0
05 Sep 2023
Fractional Denoising for 3D Molecular Pre-training
Shi Feng
Yuyan Ni
Yanyan Lan
Zhiming Ma
Wei-Ying Ma
DiffM
AI4CE
47
25
0
20 Jul 2023
Undecimated Wavelet Transform for Word Embedded Semantic Marginal Autoencoder in Security improvement and Denoising different Languages
S. Shreyanth
18
0
0
06 Jul 2023
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
128
1,152
0
17 May 2023
Toward Connecting Speech Acts and Search Actions in Conversational Search Tasks
Souvick Ghosh
Satanu Ghosh
C. Shah
25
2
0
08 May 2023
Shuffle & Divide: Contrastive Learning for Long Text
Joonseok Lee
Seongho Joe
Kyoungwon Park
Bogun Kim
Ho. Kang
Jaeseon Park
Youngjune Gwon
17
0
0
19 Apr 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
38
508
0
07 Mar 2023
MOTOR: A Time-To-Event Foundation Model For Structured Medical Records
E. Steinberg
Jason Alan Fries
Yizhe Xu
N. Shah
OOD
AI4TS
36
13
0
09 Jan 2023
Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges
K. T. Baghaei
Amirreza Payandeh
Pooya Fayyazsanavi
Shahram Rahimi
Zhiqian Chen
Somayeh Bakhtiari Ramezani
FaML
AI4TS
38
6
0
27 Nov 2022
Searching for Discriminative Words in Multidimensional Continuous Feature Space
M. Sajgalík
Michal Barla
Maria Bielikova
22
2
0
26 Nov 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELM
CLL
35
2
0
24 Oct 2022
Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning
Xingping Dong
Jianbing Shen
Ling Shao
32
7
0
27 Sep 2022
USB: A Unified Semi-supervised Learning Benchmark for Classification
Yidong Wang
Hao Chen
Yue Fan
Wangbin Sun
R. Tao
...
T. Shinozaki
Bernt Schiele
Jindong Wang
Xingxu Xie
Yue Zhang
27
113
0
12 Aug 2022
SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences
Peng Qi
Guangtao Wang
Jing Huang
24
0
0
03 Aug 2022
Neurosymbolic Repair for Low-Code Formula Languages
Rohan Bavishi
Harshit Joshi
José Pablo Cambronero Sánchez
Anna Fariha
Sumit Gulwani
Vu Le
Ivan Radicek
A. Tiwari
16
13
0
24 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
27
1
0
20 Jul 2022
ANEC: An Amharic Named Entity Corpus and Transformer Based Recognizer
Ebrahim Chekol Jibril
A. C. Tantuğ
35
7
0
02 Jul 2022
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code
Changan Niu
Chuanyi Li
Bin Luo
Vincent Ng
SyDa
VLM
55
47
0
24 May 2022
On the Role of Bidirectionality in Language Model Pre-Training
Mikel Artetxe
Jingfei Du
Naman Goyal
Luke Zettlemoyer
Ves Stoyanov
30
16
0
24 May 2022
UL2: Unifying Language Learning Paradigms
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
62
297
0
10 May 2022
Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps
Oren Barkan
Edan Hauon
Avi Caciularu
Ori Katz
Itzik Malkiel
Omri Armstrong
Noam Koenigstein
34
37
0
23 Apr 2022
Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey
Kento Nozawa
Issei Sato
AI4TS
24
4
0
18 Apr 2022
What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?
Thomas Wang
Adam Roberts
Daniel Hesslow
Teven Le Scao
Hyung Won Chung
Iz Beltagy
Julien Launay
Colin Raffel
39
167
0
12 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
47
573
0
01 Apr 2022
MetaMorph: Learning Universal Controllers with Transformers
Agrim Gupta
Linxi Fan
Surya Ganguli
Li Fei-Fei
LM&Ro
11
86
0
22 Mar 2022
Semi-Supervised Learning and Data Augmentation in Wearable-based Momentary Stress Detection in the Wild
Han Yu
Akane Sano
18
11
0
22 Feb 2022
Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning
Aditya Siddhant
Ankur Bapna
Orhan Firat
Yuan Cao
Mengzhao Chen
Isaac Caswell
Xavier Garcia
ELM
LRM
33
29
0
09 Jan 2022
CO-STAR: Conceptualisation of Stereotypes for Analysis and Reasoning
Teyun Kwon
Anandha Gopalan
27
2
0
01 Dec 2021
Merging Models with Fisher-Weighted Averaging
Michael Matena
Colin Raffel
FedML
MoMe
50
352
0
18 Nov 2021
Vector-quantized Image Modeling with Improved VQGAN
Jiahui Yu
Xin Li
Jing Yu Koh
Han Zhang
Ruoming Pang
James Qin
Alexander Ku
Yuanzhong Xu
Jason Baldridge
Yonghui Wu
ViT
VLM
DRL
49
483
0
09 Oct 2021
Inferring Offensiveness In Images From Natural Language Supervision
P. Schramowski
Kristian Kersting
32
2
0
08 Oct 2021
Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP
Zhijing Jin
Julius von Kügelgen
Jingwei Ni
Tejas Vaidhya
Ayush Kaushal
Mrinmaya Sachan
Bernhard Schoelkopf
CML
38
30
0
07 Oct 2021
A Survey On Neural Word Embeddings
Erhan Sezerer
Selma Tekir
AI4TS
26
12
0
05 Oct 2021
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
91
152
0
17 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALM
UQCV
35
3,576
0
03 Sep 2021
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning
Linyang Li
Demin Song
Xiaonan Li
Jiehang Zeng
Ruotian Ma
Xipeng Qiu
27
135
0
31 Aug 2021
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision
Zhijian Liu
Simon Stent
Jie Li
John Gideon
Song Han
VLM
25
10
0
26 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
36
25
0
05 Aug 2021
1
2
3
4
Next