ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.04090
  4. Cited By
LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
v1v2 (latest)

LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents

5 December 2024
Bingchen Li
Xin Li
Yiting Lu
Zhibo Chen
ArXiv (abs)PDFHTML

Papers citing "LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents"

50 / 56 papers shown
Title
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: KwaiSR Dataset and Study
Xin Li
Xijun Wang
Bingchen Li
Kun Yuan
Yizhen Shao
Suhang Yao
Ming-Ting Sun
Chao Zhou
Radu Timofte
Zhibo Chen
105
15
0
21 Apr 2025
Large Language Model for Lossless Image Compression with Visual Prompts
Large Language Model for Lossless Image Compression with Visual Prompts
Junhao Du
Chuqin Zhou
Ning Cao
Gang Chen
Yunuo Chen
Zhengxue Cheng
Li Song
Guo Lu
Wenjun Zhang
VLM
114
2
0
22 Feb 2025
MambaIRv2: Attentive State Space Restoration
MambaIRv2: Attentive State Space Restoration
Hang Guo
Yong Guo
Yaohua Zha
Yulun Zhang
Wenbo Li
Tao Dai
Shu-Tao Xia
Yawei Li
Mamba
202
22
0
22 Nov 2024
UCIP: A Universal Framework for Compressed Image Super-Resolution using
  Dynamic Prompt
UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt
Xin Li
Bingchen Li
Yeying Jin
Cuiling Lan
Hanxin Zhu
Yulin Ren
Zhibo Chen
100
8
0
18 Jul 2024
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed
  Image Restoration
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
Yulin Ren
Xin Li
Bingchen Li
Xingrui Wang
Mengxi Guo
Shijie Zhao
Li Zhang
Zhibo Chen
DiffM
129
7
0
15 Jul 2024
LM4LV: A Frozen Large Language Model for Low-level Vision Tasks
LM4LV: A Frozen Large Language Model for Low-level Vision Tasks
Boyang Zheng
Jinjin Gu
Shijun Li
Chao Dong
VLMMLLM
61
4
0
24 May 2024
SeD: Semantic-Aware Discriminator for Image Super-Resolution
SeD: Semantic-Aware Discriminator for Image Super-Resolution
Bingchen Li
Xin Li
Hanxin Zhu
Yeying Jin
Ruoyu Feng
Zhizheng Zhang
Zhibo Chen
SupR
101
24
0
29 Feb 2024
MambaIR: A Simple Baseline for Image Restoration with State-Space Model
MambaIR: A Simple Baseline for Image Restoration with State-Space Model
Hang Guo
Jinmin Li
Tao Dai
Zhihao Ouyang
Xudong Ren
Shu-Tao Xia
Mamba
128
249
0
23 Feb 2024
InstructIR: High-Quality Image Restoration Following Human Instructions
InstructIR: High-Quality Image Restoration Following Human Instructions
Marcos V. Conde
Gregor Geigle
Radu Timofte
DiffM
129
58
0
29 Jan 2024
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic
  Image Restoration In the Wild
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Fanghua Yu
Jinjin Gu
Zheyuan Li
Jinfan Hu
Xiangtao Kong
Xintao Wang
Jingwen He
Yu Qiao
Chao Dong
111
157
0
24 Jan 2024
Q-Refine: A Perceptual Quality Refiner for AI-Generated Image
Q-Refine: A Perceptual Quality Refiner for AI-Generated Image
Chunyi Li
Haoning Wu
Zicheng Zhang
Hongkun Hao
Kaiwei Zhang
Lei Bai
Xiaohong Liu
Xiongkuo Min
Weisi Lin
Guangtao Zhai
85
17
0
02 Jan 2024
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined
  Levels
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Haoning Wu
Zicheng Zhang
Weixia Zhang
Chaofeng Chen
Liang Liao
...
Wenxiu Sun
Qiong Yan
Xiongkuo Min
Guangtao Zhai
Weisi Lin
90
163
0
28 Dec 2023
Depicting Beyond Scores: Advancing Image Quality Assessment through
  Multi-modal Language Models
Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models
Zhiyuan You
Zheyuan Li
Jinjin Gu
Zhenfei Yin
Tianfan Xue
Chao Dong
EGVM
83
42
0
14 Dec 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Jingkang Yang
Yuhao Dong
Shuai Liu
Yue Liu
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
89
49
0
12 Oct 2023
Code Llama: Open Foundation Models for Code
Code Llama: Open Foundation Models for Code
Baptiste Rozière
Jonas Gehring
Fabian Gloeckle
Sten Sootla
Itai Gat
...
Hugo Touvron
Louis Martin
Nicolas Usunier
Thomas Scialom
Gabriel Synnaeve
ELMALM
174
2,113
0
24 Aug 2023
Dual Aggregation Transformer for Image Super-Resolution
Dual Aggregation Transformer for Image Super-Resolution
Zheng Chen
Yulun Zhang
Jinjin Gu
Lingyu Kong
Xiaokang Yang
Feng Yu
ViT
95
189
0
07 Aug 2023
ResShift: Efficient Diffusion Model for Image Super-resolution by
  Residual Shifting
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting
Zongsheng Yue
Jianyi Wang
Chen Change Loy
DiffM
124
247
0
23 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MHALM
559
12,138
0
18 Jul 2023
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language
  Navigation in Street View
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
Raphael Schumann
Wanrong Zhu
Weixi Feng
Tsu-Jui Fu
Stefan Riezler
William Yang Wang
LM&Ro
89
71
0
12 Jul 2023
ProRes: Exploring Degradation-aware Visual Prompt for Universal Image
  Restoration
ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Jiaqi Ma
Tianheng Cheng
Guoli Wang
Qian Zhang
Xinggang Wang
Lefei Zhang
DiffMVLM
81
48
0
23 Jun 2023
PromptIR: Prompting for All-in-One Blind Image Restoration
PromptIR: Prompting for All-in-One Blind Image Restoration
Vaishnav Potlapalli
Syed Waqas Zamir
Salman Khan
Fahad Shahbaz Khan
VLM
116
96
0
22 Jun 2023
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
Yao Mu
Qinglong Zhang
Mengkang Hu
Wen Wang
Mingyu Ding
Jun Jin
Bin Wang
Jifeng Dai
Yu Qiao
Ping Luo
LM&RoLRM
116
245
0
24 May 2023
Chameleon: Plug-and-Play Compositional Reasoning with Large Language
  Models
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Pan Lu
Baolin Peng
Hao Cheng
Michel Galley
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Jianfeng Gao
KELMMLLMLRM
157
326
0
19 Apr 2023
OpenAGI: When LLM Meets Domain Experts
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge
Wenyue Hua
Kai Mei
Jianchao Ji
Juntao Tan
Shuyuan Xu
Zelong Li
Yongfeng Zhang
VLMLRM
122
232
0
10 Apr 2023
Generative Diffusion Prior for Unified Image Restoration and Enhancement
Generative Diffusion Prior for Unified Image Restoration and Enhancement
Ben Fei
Zhaoyang Lyu
Liang Pan
Junzhe Zhang
Weidong Yang
Tian-jian Luo
Bo Zhang
Bo Dai
DiffM
148
193
0
03 Apr 2023
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging
  Face
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
160
913
0
30 Mar 2023
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
E. Azarnasab
Faisal Ahmed
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
ReLMKELMLRM
124
397
0
20 Mar 2023
Reflexion: Language Agents with Verbal Reinforcement Learning
Reflexion: Language Agents with Verbal Reinforcement Learning
Noah Shinn
Federico Cassano
Beck Labash
A. Gopinath
Karthik Narasimhan
Shunyu Yao
LLMAGKELM
154
1,330
0
20 Mar 2023
DiffIR: Efficient Diffusion Model for Image Restoration
DiffIR: Efficient Diffusion Model for Image Restoration
Bin Xia
Yulun Zhang
Shiyin Wang
Yitong Wang
Xing Wu
Yapeng Tian
Wenming Yang
Luc Van Gool
DiffM
126
229
0
16 Mar 2023
ViperGPT: Visual Inference via Python Execution for Reasoning
ViperGPT: Visual Inference via Python Execution for Reasoning
Dídac Surís
Sachit Menon
Carl Vondrick
MLLMLRMReLM
136
469
0
14 Mar 2023
Efficient and Explicit Modelling of Image Hierarchies for Image
  Restoration
Efficient and Explicit Modelling of Image Hierarchies for Image Restoration
Yawei Li
Yuchen Fan
Xiaoyu Xiang
D. Demandolx
Rakesh Ranjan
Radu Timofte
Luc Van Gool
105
182
0
01 Mar 2023
Augmented Language Models: a Survey
Augmented Language Models: a Survey
Grégoire Mialon
Roberto Dessì
Maria Lomeli
Christoforos Nalmpantis
Ramakanth Pasunuru
...
Jane Dwivedi-Yu
Asli Celikyilmaz
Edouard Grave
Yann LeCun
Thomas Scialom
LRMKELM
102
394
0
15 Feb 2023
Toolformer: Language Models Can Teach Themselves to Use Tools
Toolformer: Language Models Can Teach Themselves to Use Tools
Timo Schick
Jane Dwivedi-Yu
Roberto Dessì
Roberta Raileanu
Maria Lomeli
Luke Zettlemoyer
Nicola Cancedda
Thomas Scialom
SyDaRALM
222
1,781
0
09 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLMMLLM
631
4,679
0
30 Jan 2023
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and
  Transformer-Based Method
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method
Tao Wang
Kaihao Zhang
Tianrun Shen
Wenhan Luo
B. Stenger
Tong Lu
3DVViT
86
284
0
22 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual
  Learning
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLMMLLM
166
262
0
05 Dec 2022
Visual Programming: Compositional visual reasoning without training
Visual Programming: Compositional visual reasoning without training
Tanmay Gupta
Aniruddha Kembhavi
ReLMVLMLRM
183
440
0
18 Nov 2022
Exploring CLIP for Assessing the Look and Feel of Images
Exploring CLIP for Assessing the Look and Feel of Images
Jianyi Wang
Kelvin C. K. Chan
Chen Change Loy
VLM
171
586
0
25 Jul 2022
Activating More Pixels in Image Super-Resolution Transformer
Activating More Pixels in Image Super-Resolution Transformer
Xiangyu Chen
Xintao Wang
Jiantao Zhou
Yu Qiao
Chao Dong
ViT
177
647
0
09 May 2022
MANIQA: Multi-dimension Attention Network for No-Reference Image Quality
  Assessment
MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Sidi Yang
Tianhe Wu
Shu Shi
Shanshan Lao
S. Gong
Ming Cao
Jiahao Wang
Yujiu Yang
133
344
0
19 Apr 2022
Restormer: Efficient Transformer for High-Resolution Image Restoration
Restormer: Efficient Transformer for High-Resolution Image Restoration
Syed Waqas Zamir
Aditya Arora
Salman Khan
Munawar Hayat
Fahad Shahbaz Khan
Ming-Hsuan Yang
ViT
222
2,286
0
18 Nov 2021
SwinIR: Image Restoration Using Swin Transformer
SwinIR: Image Restoration Using Swin Transformer
Christos Sakaridis
Jie Cao
Guolei Sun
Peng Sun
Luc Van Gool
Radu Timofte
ViT
200
2,988
0
23 Aug 2021
Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure
  Synthetic Data
Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data
Xintao Wang
Liangbin Xie
Chao Dong
Ying Shan
135
1,190
0
22 Jul 2021
Designing a Practical Degradation Model for Deep Blind Image
  Super-Resolution
Designing a Practical Degradation Model for Deep Blind Image Super-Resolution
Peng Sun
Christos Sakaridis
Luc Van Gool
Radu Timofte
201
603
0
25 Mar 2021
Learning Texture Transformer Network for Image Super-Resolution
Learning Texture Transformer Network for Image Super-Resolution
Fuzhi Yang
Huan Yang
Jianlong Fu
Hongtao Lu
B. Guo
SupRViT
102
732
0
07 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.2K
42,753
0
28 May 2020
ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks
ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks
Xintao Wang
Ke Yu
Shixiang Wu
Jinjin Gu
Yihao Liu
Chao Dong
Chen Change Loy
Yu Qiao
Xiaoou Tang
362
3,757
0
01 Sep 2018
Recovering Realistic Texture in Image Super-resolution by Deep Spatial
  Feature Transform
Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature Transform
Xintao Wang
K. Yu
Chao Dong
Chen Change Loy
SupR
130
989
0
09 Apr 2018
Residual Dense Network for Image Super-Resolution
Residual Dense Network for Image Super-Resolution
Yulun Zhang
Yapeng Tian
Yu Kong
Bineng Zhong
Y. Fu
SupR
238
3,343
0
24 Feb 2018
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
EGVM
442
11,996
0
11 Jan 2018
12
Next