ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.06732
  4. Cited By
Sequence Level Training with Recurrent Neural Networks

Sequence Level Training with Recurrent Neural Networks

20 November 2015
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
ArXivPDFHTML

Papers citing "Sequence Level Training with Recurrent Neural Networks"

50 / 349 papers shown
Title
BLEUBERI: BLEU is a surprisingly effective reward for instruction following
BLEUBERI: BLEU is a surprisingly effective reward for instruction following
Yapei Chang
Yekyung Kim
Michael Krumdick
Amir Zadeh
Chuan Li
Chris Tanner
Mohit Iyyer
ALM
22
0
0
16 May 2025
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
M. Wong
C. Tan
ALM
83
4
0
19 Mar 2025
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation
Sucheng Ren
Qihang Yu
Ju He
Xiaohui Shen
Alan Yuille
Liang-Chieh Chen
VGen
83
6
0
27 Feb 2025
BRIDO: Bringing Democratic Order to Abstractive Summarization
BRIDO: Bringing Democratic Order to Abstractive Summarization
Junhyun Lee
Harshith Goka
Hyeonmok Ko
HILM
54
0
0
25 Feb 2025
A Fokker-Planck-Based Loss Function that Bridges Dynamics with Density Estimation
A Fokker-Planck-Based Loss Function that Bridges Dynamics with Density Estimation
Zhixin Lu
Łukasz Kuśmierz
Stefan Mihalas
70
0
0
24 Feb 2025
Sequence-level Large Language Model Training with Contrastive Preference Optimization
Sequence-level Large Language Model Training with Contrastive Preference Optimization
Zhili Feng
Dhananjay Ram
Cole Hawkins
Aditya Rawal
Jinman Zhao
Sheng Zha
62
0
0
23 Feb 2025
Decoupled Sequence and Structure Generation for Realistic Antibody Design
Decoupled Sequence and Structure Generation for Realistic Antibody Design
Nayoung Kim
Minsu Kim
Sungsoo Ahn
Jinkyoo Park
54
0
0
20 Jan 2025
Investigating Length Issues in Document-level Machine Translation
Investigating Length Issues in Document-level Machine Translation
Ziqian Peng
Rachel Bawden
François Yvon
69
1
0
23 Dec 2024
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Miguel Moura Ramos
Tomás Almeida
Daniel Vareta
Filipe Azevedo
Sweta Agrawal
Patrick Fernandes
André F. T. Martins
33
1
0
08 Nov 2024
Heterogeneous Interaction Modeling With Reduced Accumulated Error for
  Multi-Agent Trajectory Prediction
Heterogeneous Interaction Modeling With Reduced Accumulated Error for Multi-Agent Trajectory Prediction
Siyuan Chen
Jiahai Wang
38
10
0
28 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDa
AIFin
34
3
0
18 Oct 2024
The Mystery of the Pathological Path-star Task for Language Models
The Mystery of the Pathological Path-star Task for Language Models
Arvid Frydenlund
LRM
27
4
0
17 Oct 2024
Empathy Level Alignment via Reinforcement Learning for Empathetic Response Generation
Empathy Level Alignment via Reinforcement Learning for Empathetic Response Generation
Hui Ma
Bo Zhang
Bo Xu
Jian Wang
Hongfei Lin
Xiao Sun
57
1
0
06 Aug 2024
Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in
  the Era of Large Language Models
Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models
Jinliang Lu
Ziliang Pang
Min Xiao
Yaochen Zhu
Rui Xia
Jiajun Zhang
MoMe
52
18
0
08 Jul 2024
Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories
Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories
Tianlong Wang
Xianfeng Jiao
Yifan He
Zhongzhi Chen
Yinghao Zhu
Xu Chu
Junyi Gao
Yasha Wang
Liantao Ma
LLMSV
71
7
0
26 May 2024
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
Ziqiao Ma
Zekun Wang
Joyce Chai
60
2
0
22 May 2024
What Have We Achieved on Non-autoregressive Translation?
What Have We Achieved on Non-autoregressive Translation?
Yafu Li
Huajian Zhang
Jianhao Yan
Yongjing Yin
Yue Zhang
33
1
0
21 May 2024
Reinforcement Learning-Guided Semi-Supervised Learning
Reinforcement Learning-Guided Semi-Supervised Learning
Marzi Heidari
Hanping Zhang
Yuhong Guo
OffRL
39
0
0
02 May 2024
Polarity Calibration for Opinion Summarization
Polarity Calibration for Opinion Summarization
Yuanyuan Lei
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
Ruihong Huang
Dong Yu
38
0
0
02 Apr 2024
The pitfalls of next-token prediction
The pitfalls of next-token prediction
Gregor Bachmann
Vaishnavh Nagarajan
37
63
0
11 Mar 2024
On the Challenges and Opportunities in Generative AI
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
56
17
0
28 Feb 2024
Retrieval is Accurate Generation
Retrieval is Accurate Generation
Bowen Cao
Deng Cai
Leyang Cui
Xuxin Cheng
Wei Bi
Yuexian Zou
Shuming Shi
40
6
0
27 Feb 2024
Hallucination is Inevitable: An Innate Limitation of Large Language Models
Hallucination is Inevitable: An Innate Limitation of Large Language Models
Ziwei Xu
Sanjay Jain
Mohan S. Kankanhalli
HILM
LRM
71
218
0
22 Jan 2024
Reasons to Reject? Aligning Language Models with Judgments
Reasons to Reject? Aligning Language Models with Judgments
Weiwen Xu
Deng Cai
Zhisong Zhang
Wai Lam
Shuming Shi
ALM
21
14
0
22 Dec 2023
A Systematic Review of Deep Learning-based Research on Radiology Report
  Generation
A Systematic Review of Deep Learning-based Research on Radiology Report Generation
Chang Liu
Yuanhe Tian
Yan Song
MedIm
34
15
0
23 Nov 2023
Take One Step at a Time to Know Incremental Utility of Demonstration: An
  Analysis on Reranking for Few-Shot In-Context Learning
Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning
Kazuma Hashimoto
K. Raman
Michael Bendersky
39
2
0
16 Nov 2023
Context Consistency between Training and Testing in Simultaneous Machine
  Translation
Context Consistency between Training and Testing in Simultaneous Machine Translation
M. Zhong
Lemao Liu
Kehai Chen
Mingming Yang
Min Zhang
LRM
47
0
0
13 Nov 2023
Boosting Summarization with Normalizing Flows and Aggressive Training
Boosting Summarization with Normalizing Flows and Aggressive Training
Yu Yang
Xiaotong Shen
AI4CE
TPM
24
0
0
01 Nov 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li
Yiran Liu
Xingxing Zhang
Wei Lu
Furu Wei
ALM
38
3
0
20 Oct 2023
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation
Benjamin Steenhoek
Michele Tufano
Neel Sundaresan
Alexey Svyatkovskiy
OffRL
ALM
55
17
0
03 Oct 2023
Exploiting the Signal-Leak Bias in Diffusion Models
Exploiting the Signal-Leak Bias in Diffusion Models
Martin Nicolas Everaert
Athanasios Fitsios
Marco Bocchio
Sami Arpa
Sabine Süsstrunk
R. Achanta
DiffM
35
25
0
27 Sep 2023
Elucidating the Exposure Bias in Diffusion Models
Elucidating the Exposure Bias in Diffusion Models
Mang Ning
Mingxiao Li
Jianlin Su
A. A. Salah
Itir Onal Ertugrul
DiffM
121
35
0
29 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
50
10
0
28 Aug 2023
ViCo: Engaging Video Comment Generation with Human Preference Rewards
ViCo: Engaging Video Comment Generation with Human Preference Rewards
Yuchong Sun
Bei Liu
Xu Chen
Ruihua Song
Jianlong Fu
VGen
22
2
0
22 Aug 2023
A Semi-Autoregressive Graph Generative Model for Dependency Graph
  Parsing
A Semi-Autoregressive Graph Generative Model for Dependency Graph Parsing
Ye Ma
Mingming Sun
P. Li
GNN
23
1
0
21 Jun 2023
Annotation-Inspired Implicit Discourse Relation Classification with
  Auxiliary Discourse Connective Generation
Annotation-Inspired Implicit Discourse Relation Classification with Auxiliary Discourse Connective Generation
Wei Liu
Michael Strube
27
15
0
10 Jun 2023
The Surprising Effectiveness of Diffusion Models for Optical Flow and
  Monocular Depth Estimation
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Saurabh Saxena
Charles Herrmann
Junhwa Hur
Abhishek Kar
Mohammad Norouzi
Deqing Sun
David J. Fleet
DiffM
44
78
0
02 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
29
23
0
01 Jun 2023
Knowledge Graph-Augmented Language Models for Knowledge-Grounded
  Dialogue Generation
Knowledge Graph-Augmented Language Models for Knowledge-Grounded Dialogue Generation
Minki Kang
Jin Myung Kwak
Jinheon Baek
Sung Ju Hwang
RALM
16
57
0
30 May 2023
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Kathleen McKeown
Noémie Elhadad
18
10
0
28 May 2023
Topic-Guided Self-Introduction Generation for Social Media Users
Topic-Guided Self-Introduction Generation for Social Media Users
Chunpu Xu
Jing Li
Pijian Li
Min Yang
36
0
0
24 May 2023
Utility-Probability Duality of Neural Networks
Utility-Probability Duality of Neural Networks
Bojun Huang
Fei Yuan
UQCV
35
1
0
24 May 2023
Neural Machine Translation for Code Generation
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Exploring Energy-based Language Models with Different Architectures and
  Training Methods for Speech Recognition
Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
Hong Liu
Z. Lv
Zhijian Ou
Wenbo Zhao
Qing Xiao
24
0
0
22 May 2023
Balancing Lexical and Semantic Quality in Abstractive Summarization
Balancing Lexical and Semantic Quality in Abstractive Summarization
Jeewoo Sul
Y. Choi
30
4
0
17 May 2023
A Systematic Study of Knowledge Distillation for Natural Language
  Generation with Pseudo-Target Training
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
41
17
0
03 May 2023
Tuning computer vision models with task rewards
Tuning computer vision models with task rewards
André Susano Pinto
Alexander Kolesnikov
Yuge Shi
Lucas Beyer
Xiaohua Zhai
VLM
27
40
0
16 Feb 2023
A Study on ReLU and Softmax in Transformer
A Study on ReLU and Softmax in Transformer
Kai Shen
Junliang Guo
Xuejiao Tan
Siliang Tang
Rui Wang
Jiang Bian
27
53
0
13 Feb 2023
Long Text and Multi-Table Summarization: Dataset and Method
Long Text and Multi-Table Summarization: Dataset and Method
Shuaiqi Liu
Jiannong Cao
Ruosong Yang
Zhiyuan Wen
RALM
24
21
0
08 Feb 2023
Execution-based Code Generation using Deep Reinforcement Learning
Execution-based Code Generation using Deep Reinforcement Learning
Parshin Shojaee
Aneesh Jain
Sindhu Tipirneni
Chandan K. Reddy
25
52
0
31 Jan 2023
1234567
Next