Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.01578
Cited By
v1
v2 (latest)
Neural Architecture Search with Reinforcement Learning
5 November 2016
Barret Zoph
Quoc V. Le
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Architecture Search with Reinforcement Learning"
50 / 75 papers shown
Title
RBFleX-NAS: Training-Free Neural Architecture Search Using Radial Basis Function Kernel and Hyperparameter Detection
Tomomasa Yamasaki
Zhehui Wang
Yaoyu Zhang
Niangjun Chen
Bo Wang
220
0
0
26 Mar 2025
Moss: Proxy Model-based Full-Weight Aggregation in Federated Learning with Heterogeneous Models
Y. Cai
Ziqi Zhang
Ding Li
Yao Guo
Xiangqun Chen
159
0
0
13 Mar 2025
Transfer Learning with Pre-trained Conditional Generative Models
Shin'ya Yamaguchi
Sekitoshi Kanai
Atsutoshi Kumagai
Daiki Chijiwa
H. Kashima
VLM
CLL
BDL
DiffM
249
5
0
21 Feb 2025
NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance
Raphael T. Husistein
Markus Reiher
Marco Eckhoff
248
1
0
20 Feb 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
192
2
0
28 Jan 2025
Evolutionary Optimization of Model Merging Recipes
Takuya Akiba
Makoto Shing
Yujin Tang
Qi Sun
David Ha
MoMe
297
125
0
28 Jan 2025
Robust Guided Diffusion for Offline Black-Box Optimization
Can
Chen
Christopher Beckham
Xue Liu
Xue Liu
OffRL
129
5
0
03 Jan 2025
Causal-aware Graph Neural Architecture Search under Distribution Shifts
Peiwen Li
Xin Wang
Zeyang Zhang
Yi Qin
Ziwei Zhang
Jialong Wang
Yang Li
Wenwu Zhu
CML
OOD
144
4
0
31 Dec 2024
VMamba: Visual State Space Model
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
312
709
0
31 Dec 2024
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Akhiad Bercovich
Tomer Ronen
Talor Abramovich
Nir Ailon
Nave Assaf
...
Ido Shahaf
Oren Tropp
Omer Ullman Argov
Ran Zilberstein
Ran El-Yaniv
205
3
0
28 Nov 2024
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization
Zeyuan Ma
Hongshu Guo
Yue-Jiao Gong
Jun Zhang
Kay Chen Tan
296
5
0
01 Nov 2024
EM-DARTS: Hierarchical Differentiable Architecture Search for Eye Movement Recognition
Huafeng Qin
Hongyu Zhu
Xin Jin
Xin Yu
M. El-Yacoubi
Xinbo Gao
120
0
0
22 Sep 2024
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval
Amirreza Mahbod
Nematollah Saeidi
Sepideh Hatamikia
Ramona Woitek
VLM
MedIm
108
3
0
14 Sep 2024
Layerwise Recurrent Router for Mixture-of-Experts
Zihan Qiu
Zeyu Huang
Shuang Cheng
Yizhi Zhou
Zili Wang
Ivan Titov
Jie Fu
MoE
149
2
0
13 Aug 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffM
VLM
145
5
0
17 Jun 2024
Evaluating Zero-Shot Long-Context LLM Compression
Chenyu Wang
Yihan Wang
Kai Li
110
0
0
10 Jun 2024
OCCAM: Towards Cost-Efficient and Accuracy-Aware Classification Inference
Dujian Ding
Bicheng Xu
L. Lakshmanan
VLM
112
2
0
06 Jun 2024
Design Editing for Offline Model-based Optimization
Ye Yuan
Youyuan Zhang
Can Chen
Haolun Wu
Zixuan Li
Jianmo Li
James J. Clark
Xue Liu
98
5
0
22 May 2024
Data Augmentation Policy Search for Long-Term Forecasting
Liran Nochumsohn
Omri Azencot
AI4TS
TPM
139
5
0
01 May 2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
Xupeng Miao
Gabriele Oliaro
Xinhao Cheng
Vineeth Kada
Ruohan Gao
...
April Yang
Yingcheng Wang
Mengdi Wu
Colin Unger
Zhihao Jia
MoE
165
10
0
29 Feb 2024
Multi-objective Differentiable Neural Architecture Search
R. Sukthanker
Arber Zela
B. Staffler
Samuel Dooley
Josif Grabocka
Frank Hutter
135
1
0
28 Feb 2024
CBQ: Cross-Block Quantization for Large Language Models
Xin Ding
Xiaoyu Liu
Zhijun Tu
Yun-feng Zhang
Wei Li
...
Hanting Chen
Yehui Tang
Zhiwei Xiong
Baoqun Yin
Yunhe Wang
MQ
111
17
0
13 Dec 2023
TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Jianlei Yang
Jiacheng Liao
Fanding Lei
Meichen Liu
Junyi Chen
Lingkun Long
Han Wan
Bei Yu
Weisheng Zhao
MoE
127
2
0
03 Nov 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
167
13
0
28 Aug 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
193
0
0
23 May 2023
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
Bichen Wu
Chaojian Li
Hang Zhang
Xiaoliang Dai
Peizhao Zhang
Matthew Yu
Jialiang Wang
Yingyan Lin
Peter Vajda
ViT
129
24
0
19 Nov 2021
InstantNet: Automated Generation and Deployment of Instantaneously Switchable-Precision Networks
Yonggan Fu
Zhongzhi Yu
Yongan Zhang
Yi Ding
Chaojian Li
Yongyuan Liang
Mingchao Jiang
Zhangyang Wang
Yingyan Lin
85
3
0
22 Apr 2021
DrNAS: Dirichlet Neural Architecture Search
Xiangning Chen
Ruochen Wang
Minhao Cheng
Xiaocheng Tang
Cho-Jui Hsieh
OOD
82
103
0
18 Jun 2020
Multi-fidelity Neural Architecture Search with Knowledge Distillation
I. Trofimov
Nikita Klyuchnikov
Mikhail Salnikov
Alexander N. Filippov
Evgeny Burnaev
99
15
0
15 Jun 2020
AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP Investigation in the Recommender System
Pengyu Zhao
Kecheng Xiao
Yuanxing Zhang
Kaigui Bian
Wei Yan
102
16
0
10 Jun 2020
Differentiable Neural Input Search for Recommender Systems
Weiyu Cheng
Yanyan Shen
Linpeng Huang
71
36
0
08 Jun 2020
Cryptanalytic Extraction of Neural Network Models
Nicholas Carlini
Matthew Jagielski
Ilya Mironov
FedML
MLAU
MIACV
AAML
153
137
0
10 Mar 2020
Learning to reinforcement learn for Neural Architecture Search
J. Gomez
Joaquin Vanschoren
69
8
0
09 Nov 2019
Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators
Weiwen Jiang
Qiuwen Lou
Zheyu Yan
Lei Yang
Jiaxi Hu
X. S. Hu
Yiyu Shi
96
72
0
31 Oct 2019
Fast-Slow Recurrent Neural Networks
Asier Mujika
Florian Meier
Angelika Steger
94
77
0
24 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDL
OffRL
84
120
0
14 May 2017
Genetic CNN
Lingxi Xie
Alan Yuille
3DV
132
847
0
04 Mar 2017
Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling
Hakan Inan
Khashayar Khosravi
R. Socher
130
385
0
04 Nov 2016
HyperNetworks
David R Ha
Andrew M. Dai
Quoc V. Le
178
1,633
0
27 Sep 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
920
6,799
0
26 Sep 2016
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
352
2,901
0
26 Sep 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
PINN
3DV
887
36,910
0
25 Aug 2016
Using the Output Embedding to Improve Language Models
Ofir Press
Lior Wolf
104
738
0
20 Aug 2016
Recurrent Highway Networks
J. Zilly
R. Srivastava
Jan Koutník
Jürgen Schmidhuber
120
418
0
12 Jul 2016
Learning to learn by gradient descent by gradient descent
Marcin Andrychowicz
Misha Denil
Sergio Gomez Colmenarejo
Matthew W. Hoffman
David Pfau
Tom Schaul
Brendan Shillingford
Nando de Freitas
134
2,009
0
14 Jun 2016
Convolutional Neural Fabrics
Shreyas Saxena
Jakob Verbeek
86
226
0
08 Jun 2016
Learning to Optimize
Ke Li
Jitendra Malik
63
258
0
06 Jun 2016
FractalNet: Ultra-Deep Neural Networks without Residuals
Gustav Larsson
Michael Maire
Gregory Shakhnarovich
165
941
0
24 May 2016
Wide Residual Networks
Sergey Zagoruyko
N. Komodakis
362
8,005
0
23 May 2016
Deep Networks with Stochastic Depth
Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
219
2,365
0
30 Mar 2016
1
2
Next