ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.09203
  4. Cited By
Release Strategies and the Social Impacts of Language Models

Release Strategies and the Social Impacts of Language Models

24 August 2019
Irene Solaiman
Miles Brundage
Jack Clark
Amanda Askell
Ariel Herbert-Voss
Jeff Wu
Alec Radford
Gretchen Krueger
Jong Wook Kim
Sarah Kreps
Miles McCain
Alex Newhouse
Jason Blazakis
Kris McGuffie
Jasmine Wang
ArXivPDFHTML

Papers citing "Release Strategies and the Social Impacts of Language Models"

50 / 121 papers shown
Title
Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction
Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction
Xiaowei Zhu
Yubing Ren
Yanan Cao
Xixun Lin
Fang Fang
Yangxi Li
45
0
0
08 May 2025
Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs
Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs
Chetan Pathade
AAML
SILM
59
0
0
07 May 2025
Real-World Gaps in AI Governance Research
Real-World Gaps in AI Governance Research
Ilan Strauss
Isobel Moure
Tim O'Reilly
Sruly Rosenblat
65
0
0
30 Apr 2025
Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records
Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records
Jesus Lovon
Thouria Ben-Haddi
Jules Di Scala
José G. Moreno
L. Tamine
60
2
0
29 Apr 2025
Could AI Trace and Explain the Origins of AI-Generated Images and Text?
Could AI Trace and Explain the Origins of AI-Generated Images and Text?
Hongchao Fang
Yixin Liu
Ran Xu
Can Qin
Yong-Jin Liu
Feng Liu
Lichao Sun
Dongwon Lee
Lifu Huang
Wenpeng Yin
DeLMO
68
0
0
05 Apr 2025
Is Less Really More? Fake News Detection with Limited Information
Is Less Really More? Fake News Detection with Limited Information
Zhaoyang Cao
John Nguyen
Reza Zafarani
59
0
0
02 Apr 2025
TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors
Jingyi Zheng
Junfeng Wang
Zhen Sun
Wenhan Dong
Yule Liu
Xinlei He
AAML
50
0
0
10 Mar 2025
UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction
UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction
Chenyu Li
Danfeng Hong
Bing Zhang
Yuxuan Li
Gustau Camps-Valls
X. Zhu
J. Chanussot
66
1
0
24 Feb 2025
Beyond Release: Access Considerations for Generative AI Systems
Beyond Release: Access Considerations for Generative AI Systems
Irene Solaiman
Rishi Bommasani
Dan Hendrycks
Ariel Herbert-Voss
Yacine Jernite
Aviya Skowron
Andrew Trask
65
1
0
23 Feb 2025
Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing
Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing
Shoumik Saha
S. Feizi
DeLMO
70
0
0
21 Feb 2025
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection
Guangsheng Bao
Yanbin Zhao
Juncai He
Yue Zhang
VLM
96
2
0
20 Feb 2025
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL
Wichayaporn Wongkamjan
Yanze Wang
Feng Gu
Denis Peskoff
Jonathan K. Kummerfeld
Jonathan May
Jordan Lee Boyd-Graber
58
0
0
18 Feb 2025
Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings
Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings
A. K. Kadhim
Lei Jiao
R. Shafik
Ole-Christoffer Granmo
DeLMO
74
0
0
31 Jan 2025
FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments
Zhiyuan Fu
Junfan Chen
Hongyu Sun
Ting Yang
Ruidong Li
Yuqing Zhang
49
0
0
28 Jan 2025
Can AI-Generated Text be Reliably Detected?
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
S. Feizi
DeLMO
78
365
0
20 Jan 2025
CancerKG.ORG A Web-scale, Interactive, Verifiable Knowledge Graph-LLM Hybrid for Assisting with Optimal Cancer Treatment and Care
Michael Gubanov
Anna Pyayt
Aleksandra Karolak
53
3
0
03 Jan 2025
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
Batu Guan
Yao Wan
Zhangqian Bi
Zheng Wang
Hongyu Zhang
Yulei Sui
Pan Zhou
39
8
0
31 Dec 2024
Beemo: Benchmark of Expert-edited Machine-generated Outputs
Beemo: Benchmark of Expert-edited Machine-generated Outputs
Ekaterina Artemova
Jason Samuel Lucas
Saranya Venkatraman
Jooyoung Lee
Sergei Tilga
Adaku Uchendu
Vladislav Mikhailov
DeLMO
MoE
68
4
0
06 Nov 2024
DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
Junchao Wu
Runzhe Zhan
Derek F. Wong
Shu Yang
Xinyi Yang
Yulin Yuan
Lidia S. Chao
DeLMO
58
2
0
31 Oct 2024
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting
Can Chen
Jun-Kun Wang
DeLMO
42
0
0
29 Oct 2024
Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Honglin Mu
Han He
Yuxin Zhou
Yunlong Feng
Yang Xu
...
Zeming Liu
Xudong Han
Qi Shi
Qingfu Zhu
Wanxiang Che
AAML
43
1
0
28 Oct 2024
Unveiling Large Language Models Generated Texts: A Multi-Level
  Fine-Grained Detection Framework
Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection Framework
Zhen Tao
Zhiyu Li
Runyu Chen
Dinghao Xi
Wei Xu
DeLMO
26
1
0
18 Oct 2024
Training-free LLM-generated Text Detection by Mining Token Probability
  Sequences
Training-free LLM-generated Text Detection by Mining Token Probability Sequences
Yihuai Xu
Yongwei Wang
Yifei Bi
Huangsen Cao
Zhouhan Lin
Yu Zhao
Fei Wu
DeLMO
26
0
0
08 Oct 2024
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
Ximing Lu
Melanie Sclar
Skyler Hallinan
Niloofar Mireshghallah
Jiacheng Liu
...
Allyson Ettinger
Liwei Jiang
Khyathi Raghavi Chandu
Nouha Dziri
Yejin Choi
DeLMO
51
11
0
05 Oct 2024
Detecting Machine-Generated Long-Form Content with Latent-Space
  Variables
Detecting Machine-Generated Long-Form Content with Latent-Space Variables
Yufei Tian
Zeyu Pan
Nanyun Peng
DeLMO
31
0
0
04 Oct 2024
Personality Alignment of Large Language Models
Personality Alignment of Large Language Models
Minjun Zhu
Linyi Yang
Yue Zhang
Yue Zhang
ALM
67
5
0
21 Aug 2024
Learning to Rewrite: Generalized LLM-Generated Text Detection
Learning to Rewrite: Generalized LLM-Generated Text Detection
Wei Hao
Ran Li
Weiliang Zhao
Junfeng Yang
Chengzhi Mao
DeLMO
59
3
0
08 Aug 2024
Neural Network Emulator for Atmospheric Chemical ODE
Neural Network Emulator for Atmospheric Chemical ODE
Zhi-Song Liu
Petri S. Clusius
Michael Boy
42
3
0
03 Aug 2024
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Kathleen C. Fraser
Hillary Dawkins
S. Kiritchenko
DeLMO
79
7
0
21 Jun 2024
Watermarking Language Models with Error Correcting Codes
Watermarking Language Models with Error Correcting Codes
Patrick Chao
Yan Sun
Edgar Dobriban
Hamed Hassani
WaLM
40
3
0
12 Jun 2024
Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in
  Large Language Models
Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models
Jisu Shin
Hoyun Song
Huije Lee
Soyeong Jeong
Jong C. Park
38
6
0
06 Jun 2024
Ranking Manipulation for Conversational Search Engines
Ranking Manipulation for Conversational Search Engines
Samuel Pfrommer
Yatong Bai
Tanmay Gautam
Somayeh Sojoudi
SILM
47
4
0
05 Jun 2024
Transformer and Hybrid Deep Learning Based Models for Machine-Generated
  Text Detection
Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection
Teodor-George Marchitan
Claudiu Creanga
Liviu P. Dinu
DeLMO
23
1
0
28 May 2024
ReMoDetect: Reward Models Recognize Aligned LLM's Generations
ReMoDetect: Reward Models Recognize Aligned LLM's Generations
Hyunseok Lee
Jihoon Tack
Jinwoo Shin
DeLMO
40
0
0
27 May 2024
ChatGPT Code Detection: Techniques for Uncovering the Source of Code
ChatGPT Code Detection: Techniques for Uncovering the Source of Code
Marc Oedingen
Raphael C. Engelhardt
Robin Denz
Maximilian Hammer
Wolfgang Konen
DeLMO
45
8
0
24 May 2024
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Junchao Wu
Runzhe Zhan
Derek F. Wong
Shu Yang
Xuebo Liu
Lidia S. Chao
Min Zhang
DeLMO
46
4
0
07 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
53
2
0
06 May 2024
MUGC: Machine Generated versus User Generated Content Detection
MUGC: Machine Generated versus User Generated Content Detection
Yaqi Xie
Anjali Rawal
Yujing Cen
Dixuan Zhao
S. K. Narang
Shanu Sushmita
DeLMO
43
3
0
28 Mar 2024
GenAI Detection Tools, Adversarial Techniques and Implications for
  Inclusivity in Higher Education
GenAI Detection Tools, Adversarial Techniques and Implications for Inclusivity in Higher Education
Mike Perkins
Jasper Roe
Binh H. Vu
Darius Postma
Don Hickerson
James McGaughran
Huy Q. Khuat British University Vietnam
DeLMO
43
19
0
28 Mar 2024
On the Societal Impact of Open Foundation Models
On the Societal Impact of Open Foundation Models
Sayash Kapoor
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Ashwin Ramaswami
...
Victor Storchan
Daniel Zhang
Daniel E. Ho
Percy Liang
Arvind Narayanan
26
54
0
27 Feb 2024
Generative Models are Self-Watermarked: Declaring Model Authentication
  through Re-Generation
Generative Models are Self-Watermarked: Declaring Model Authentication through Re-Generation
Aditya Desu
Xuanli He
Qiongkai Xu
Wei Lu
WIGM
24
1
0
23 Feb 2024
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Fengqing Jiang
Zhangchen Xu
Luyao Niu
Zhen Xiang
Bhaskar Ramasubramanian
Bo Li
Radha Poovendran
49
87
0
19 Feb 2024
Machine-Generated Text Localization
Machine-Generated Text Localization
Zhongping Zhang
Wenda Qin
Bryan A. Plummer
DeLMO
36
5
0
19 Feb 2024
ALISON: Fast and Effective Stylometric Authorship Obfuscation
ALISON: Fast and Effective Stylometric Authorship Obfuscation
Eric Xing
Saranya Venkatraman
Thai V. Le
Dongwon Lee
DeLMO
22
1
0
01 Feb 2024
To Burst or Not to Burst: Generating and Quantifying Improbable Text
To Burst or Not to Burst: Generating and Quantifying Improbable Text
Kuleen Sasse
Samuel Barham
Efsun Sarioglu Kayi
Edward W. Staley
DeLMO
27
1
0
27 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Wang
Xin Li
Luisa Verdoliva
Shu Hu
88
58
0
22 Jan 2024
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Dominik Macko
Robert Moro
Adaku Uchendu
Ivan Srba
Jason Samuel Lucas
Michiharu Yamashita
Nafis Irtiza Tripto
Dongwon Lee
Jakub Simko
Maria Bielikova
DeLMO
40
17
0
15 Jan 2024
Classification of Human- and AI-Generated Texts for English, French,
  German, and Spanish
Classification of Human- and AI-Generated Texts for English, French, German, and Spanish
Kristina Schaaff
Tim Schlippe
Lorenz Mindner
DeLMO
19
4
0
08 Dec 2023
Machine-Generated Text Detection using Deep Learning
Machine-Generated Text Detection using Deep Learning
Raghav Gaggar
Ashish Bhagchandani
Harsh V Oza
DeLMO
20
2
0
26 Nov 2023
Controlled Text Generation via Language Model Arithmetic
Controlled Text Generation via Language Model Arithmetic
Jasper Dekoninck
Marc Fischer
Luca Beurer-Kellner
Martin Vechev
31
36
0
24 Nov 2023
123
Next