ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.13708
  4. Cited By
Towards Safe Multilingual Frontier AI

Towards Safe Multilingual Frontier AI

6 September 2024
Artūrs Kanepajs
Vladimir Ivanov
Richard Moulange
ArXivPDFHTML

Papers citing "Towards Safe Multilingual Frontier AI"

8 / 8 papers shown
Title
Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: A Systematic Scoping Review
Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: A Systematic Scoping Review
Hongyi Yang
Fangyuan Chang
Dian Zhu
Muroi Fumie
Zhao Liu
62
22
0
28 Jan 2025
OR-Bench: An Over-Refusal Benchmark for Large Language Models
OR-Bench: An Over-Refusal Benchmark for Large Language Models
Justin Cui
Wei-Lin Chiang
Ion Stoica
Cho-Jui Hsieh
ALM
79
45
0
31 May 2024
A Cross-Language Investigation into Jailbreak Attacks in Large Language
  Models
A Cross-Language Investigation into Jailbreak Attacks in Large Language Models
Jie Li
Yi Liu
Chongyang Liu
Ling Shi
Xiaoning Ren
Yaowen Zheng
Yang Liu
Yinxing Xue
AAML
58
26
0
30 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Black-Box Access is Insufficient for Rigorous AI Audits
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
89
84
0
25 Jan 2024
Towards Publicly Accountable Frontier LLMs: Building an External
  Scrutiny Ecosystem under the ASPIRE Framework
Towards Publicly Accountable Frontier LLMs: Building an External Scrutiny Ecosystem under the ASPIRE Framework
Markus Anderljung
Everett Thornton Smith
Joe O'Brien
Lisa Soder
Ben Bucknall
Emma Bluemke
Jonas Schuett
Robert F. Trager
Lacey Strahm
Rumman Chowdhury
72
18
0
15 Nov 2023
FP8-LM: Training FP8 Large Language Models
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
74
42
0
27 Oct 2023
All Languages Matter: On the Multilingual Safety of Large Language
  Models
All Languages Matter: On the Multilingual Safety of Large Language Models
Wenxuan Wang
Zhaopeng Tu
Chang Chen
Youliang Yuan
Jen-tse Huang
Wenxiang Jiao
Michael R. Lyu
ALM
LRM
54
32
0
02 Oct 2023
Universal and Transferable Adversarial Attacks on Aligned Language
  Models
Universal and Transferable Adversarial Attacks on Aligned Language Models
Andy Zou
Zifan Wang
Nicholas Carlini
Milad Nasr
J. Zico Kolter
Matt Fredrikson
195
1,376
0
27 Jul 2023
1