ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.09431
  4. Cited By
A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy

A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy

17 January 2025
Huandong Wang
Wenjie Fu
Yingzhou Tang
Zhilong Chen
Yanhua Huang
J. Piao
Chen Gao
Fengli Xu
Tao Jiang
Yongqian Li
    PILM
ArXiv (abs)PDFHTML

Papers citing "A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy"

5 / 5 papers shown
Title
Understanding How Value Neurons Shape the Generation of Specified Values in LLMs
Yi Su
Jiayi Zhang
Shu Yang
Xinhai Wang
Lijie Hu
Di Wang
OffRL
186
2
0
23 May 2025
IMPersona: Evaluating Individual Level LM Impersonation
IMPersona: Evaluating Individual Level LM Impersonation
Quan Shi
Carlos E. Jimenez
Stephen Dong
Brian Seo
Caden Yao
Adam Kelch
Karthik Narasimhan
58
0
0
06 Apr 2025
What Large Language Models Do Not Talk About: An Empirical Study of Moderation and Censorship Practices
What Large Language Models Do Not Talk About: An Empirical Study of Moderation and Censorship Practices
Sander Noels
Guillaume Bied
Maarten Buyl
Alexander Rogiers
Yousra Fettach
Jefrey Lijffijt
Tijl De Bie
101
1
0
04 Apr 2025
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies
Luyi Jiang
Jiasi Chen
Lu Lu
Xinwei Peng
Lihao Liu
Junjun He
Jie Xu
ELMLM&MA
80
0
0
10 Mar 2025
Membership Inference Risks in Quantized Models: A Theoretical and Empirical Study
Eric Aubinais
Philippe Formont
Pablo Piantanida
Elisabeth Gassiat
112
1
0
10 Feb 2025
1