ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04359
  4. Cited By
Ethical and social risks of harm from Language Models

Ethical and social risks of harm from Language Models

8 December 2021
Laura Weidinger
John F. J. Mellor
Maribeth Rauh
Conor Griffin
J. Uesato
Po-Sen Huang
Myra Cheng
Mia Glaese
Borja Balle
Atoosa Kasirzadeh
Zachary Kenton
S. Brown
Will Hawkins
T. Stepleton
Courtney Biles
Abeba Birhane
Julia Haas
Laura Rimell
Lisa Anne Hendricks
William S. Isaac
Sean Legassick
G. Irving
Iason Gabriel
    PILM
ArXiv (abs)PDFHTML

Papers citing "Ethical and social risks of harm from Language Models"

50 / 634 papers shown
Title
Will Code Remain a Relevant User Interface for End-User Programming with
  Generative AI Models?
Will Code Remain a Relevant User Interface for End-User Programming with Generative AI Models?
Advait Sarkar
74
19
0
01 Nov 2023
Sentiment Analysis in Digital Spaces: An Overview of Reviews
Sentiment Analysis in Digital Spaces: An Overview of Reviews
L. Ayravainen
Joanne Hinds
Brittany I. Davidson
84
0
0
30 Oct 2023
Global Voices, Local Biases: Socio-Cultural Prejudices across Languages
Global Voices, Local Biases: Socio-Cultural Prejudices across Languages
A. Mukherjee
Chahat Raj
Ziwei Zhu
Antonios Anastasopoulos
85
17
0
26 Oct 2023
Unpacking the Ethical Value Alignment in Big Models
Unpacking the Ethical Value Alignment in Big Models
Xiaoyuan Yi
Jing Yao
Xiting Wang
Xing Xie
79
13
0
26 Oct 2023
FANToM: A Benchmark for Stress-testing Machine Theory of Mind in
  Interactions
FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Hyunwoo J. Kim
Melanie Sclar
Xuhui Zhou
Ronan Le Bras
Gunhee Kim
Yejin Choi
Maarten Sap
LLMAG
84
92
0
24 Oct 2023
Synergizing Human-AI Agency: A Guide of 23 Heuristics for Service
  Co-Creation with LLM-Based Agents
Synergizing Human-AI Agency: A Guide of 23 Heuristics for Service Co-Creation with LLM-Based Agents
Qingxiao Zheng
Zhongwei Xu
Abhinav Choudhary
Yuting Chen
Yongming Li
Yun Huang
LLMAG
86
7
0
23 Oct 2023
NormDial: A Comparable Bilingual Synthetic Dialog Dataset for Modeling
  Social Norm Adherence and Violation
NormDial: A Comparable Bilingual Synthetic Dialog Dataset for Modeling Social Norm Adherence and Violation
Aochong Li
Mallika Subramanian
Arkadiy Saakyan
Sky CH-Wang
Smaranda Muresan
76
14
0
23 Oct 2023
Values, Ethics, Morals? On the Use of Moral Concepts in NLP Research
Values, Ethics, Morals? On the Use of Moral Concepts in NLP Research
Karina Vida
Judith Simon
Anne Lauscher
73
18
0
21 Oct 2023
Safe RLHF: Safe Reinforcement Learning from Human Feedback
Safe RLHF: Safe Reinforcement Learning from Human Feedback
Josef Dai
Xuehai Pan
Ruiyang Sun
Jiaming Ji
Xinbo Xu
Mickel Liu
Yizhou Wang
Yaodong Yang
141
364
0
19 Oct 2023
Identifying and Adapting Transformer-Components Responsible for Gender
  Bias in an English Language Model
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model
Abhijith Chintam
Rahel Beloch
Willem H. Zuidema
Michael Hanna
Oskar van der Wal
80
18
0
19 Oct 2023
Privacy Preserving Large Language Models: ChatGPT Case Study Based
  Vision and Framework
Privacy Preserving Large Language Models: ChatGPT Case Study Based Vision and Framework
Imdad Ullah
Najm Hassan
S. Gill
Basem Suleiman
T. Ahanger
Zawar Shah
Junaid Qadir
S. Kanhere
92
17
0
19 Oct 2023
Denevil: Towards Deciphering and Navigating the Ethical Values of Large
  Language Models via Instruction Learning
Denevil: Towards Deciphering and Navigating the Ethical Values of Large Language Models via Instruction Learning
Shitong Duan
Xiaoyuan Yi
Peng Zhang
Tun Lu
Xing Xie
Ning Gu
90
12
0
17 Oct 2023
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
126
19
0
15 Oct 2023
Case Law Grounding: Aligning Judgments of Humans and AI on
  Socially-Constructed Concepts
Case Law Grounding: Aligning Judgments of Humans and AI on Socially-Constructed Concepts
Quan Ze Chen
Amy X. Zhang
ELM
128
2
0
10 Oct 2023
Teaching Language Models to Hallucinate Less with Synthetic Tasks
Teaching Language Models to Hallucinate Less with Synthetic Tasks
Erik Jones
Hamid Palangi
Clarisse Simoes
Varun Chandrasekaran
Subhabrata Mukherjee
Arindam Mitra
Ahmed Hassan Awadallah
Ece Kamar
HILM
87
27
0
10 Oct 2023
Anticipating Impacts: Using Large-Scale Scenario Writing to Explore
  Diverse Implications of Generative AI in the News Environment
Anticipating Impacts: Using Large-Scale Scenario Writing to Explore Diverse Implications of Generative AI in the News Environment
Kimon Kieslich
Nicholas Diakopoulos
Natali Helberger
71
18
0
10 Oct 2023
Rephrase, Augment, Reason: Visual Grounding of Questions for
  Vision-Language Models
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Archiki Prasad
Elias Stengel-Eskin
Mohit Bansal
ReLMLRM
74
8
0
09 Oct 2023
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text
  Generation
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Abe Bohan Hou
Jingyu Zhang
Tianxing He
Yichen Wang
Yung-Sung Chuang
Hongwei Wang
Lingfeng Shen
Benjamin Van Durme
Daniel Khashabi
Yulia Tsvetkov
WaLM
92
0
0
06 Oct 2023
Assessing Large Language Models on Climate Information
Assessing Large Language Models on Climate Information
Jannis Bulian
Mike S. Schäfer
Afra Amini
Heidi Lam
Massimiliano Ciaramita
...
Michelle Chen Huebscher
Christian Buck
Niels G. Mede
Markus Leippold
Nadine Strauss
ELM
81
22
0
04 Oct 2023
Low-Resource Languages Jailbreak GPT-4
Low-Resource Languages Jailbreak GPT-4
Zheng-Xin Yong
Cristina Menghini
Stephen H. Bach
SILM
122
205
0
03 Oct 2023
LoFT: Local Proxy Fine-tuning For Improving Transferability Of
  Adversarial Attacks Against Large Language Model
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model
Muhammad Ahmed Shah
Roshan S. Sharma
Hira Dhamyal
R. Olivier
Ankit Shah
...
Massa Baali
Soham Deshmukh
Michael Kuhlmann
Bhiksha Raj
Rita Singh
AAML
67
21
0
02 Oct 2023
All Languages Matter: On the Multilingual Safety of Large Language
  Models
All Languages Matter: On the Multilingual Safety of Large Language Models
Wenxuan Wang
Zhaopeng Tu
Chang Chen
Youliang Yuan
Jen-tse Huang
Wenxiang Jiao
Michael R. Lyu
ALMLRM
98
34
0
02 Oct 2023
No Offense Taken: Eliciting Offensiveness from Language Models
No Offense Taken: Eliciting Offensiveness from Language Models
Anugya Srivastava
Rahul Ahuja
Rohith Mukku
46
3
0
02 Oct 2023
Can Sensitive Information Be Deleted From LLMs? Objectives for Defending
  Against Extraction Attacks
Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
Vaidehi Patil
Peter Hase
Joey Tianyi Zhou
KELMAAML
136
108
0
29 Sep 2023
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks,
  benefits, and alternative methods for pursuing open-source objectives
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives
Elizabeth Seger
Noemi Dreksler
Richard Moulange
Emily Dardaman
Jonas Schuett
...
Emma Bluemke
Michael Aird
Patrick Levermore
Julian Hazell
Abhishek Gupta
74
43
0
29 Sep 2023
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
  Haystack
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
Xiaoliang Dai
Ji Hou
Chih-Yao Ma
Sam S. Tsai
Jialiang Wang
...
Roshan Sumbaly
Vignesh Ramanathan
Zijian He
Peter Vajda
Devi Parikh
VLM
91
216
0
27 Sep 2023
How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking
  Unrelated Questions
How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Lorenzo Pacchiardi
A. J. Chan
Sören Mindermann
Ilan Moscovitz
Alexa Y. Pan
Y. Gal
Owain Evans
J. Brauner
LLMAGHILM
80
54
0
26 Sep 2023
More than Model Documentation: Uncovering Teachers' Bespoke Information
  Needs for Informed Classroom Integration of ChatGPT
More than Model Documentation: Uncovering Teachers' Bespoke Information Needs for Informed Classroom Integration of ChatGPT
Mei Tan
Hariharan Subramonyam
102
19
0
25 Sep 2023
Can LLM-Generated Misinformation Be Detected?
Can LLM-Generated Misinformation Be Detected?
Canyu Chen
Kai Shu
DeLMO
191
182
0
25 Sep 2023
Probing the Moral Development of Large Language Models through Defining
  Issues Test
Probing the Moral Development of Large Language Models through Defining Issues Test
Kumar Tanmay
Aditi Khandelwal
Utkarsh Agarwal
Monojit Choudhury
LRM
58
16
0
23 Sep 2023
Using ChatGPT in HCI Research -- A Trioethnography
Using ChatGPT in HCI Research -- A Trioethnography
Smit Desai
Tanusree Sharma
Pratyasha Saha
28
13
0
22 Sep 2023
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
Chengyuan Liu
Fubang Zhao
Lizhi Qing
Yangyang Kang
Changlong Sun
Kun Kuang
Leilei Gan
AAML
75
21
0
21 Sep 2023
"It's a Fair Game", or Is It? Examining How Users Navigate Disclosure
  Risks and Benefits When Using LLM-Based Conversational Agents
"It's a Fair Game", or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents
Zhiping Zhang
Michelle Jia
Hao-Ping Lee
Bingsheng Yao
Sauvik Das
Ada Lerner
Dakuo Wang
Tianshi Li
SILMELM
82
81
0
20 Sep 2023
The Role of Inclusion, Control, and Ownership in Workplace AI-Mediated
  Communication
The Role of Inclusion, Control, and Ownership in Workplace AI-Mediated Communication
Kowe Kadoma
Marianne Aubin Le Quere
Jenny Fu
Christin Munsch
D. Metaxa
Mor Naaman
64
14
0
20 Sep 2023
AI (r)evolution -- where are we heading? Thoughts about the future of
  music and sound technologies in the era of deep learning
AI (r)evolution -- where are we heading? Thoughts about the future of music and sound technologies in the era of deep learning
Giovanni Bindi
Nils Demerlé
Rodrigo Diaz
David Genova
Aliénor Golvet
...
Yixiao Zhang
Axel Roebel
Nick Bryan-Kinns
Jean-Louis Giavitto
M. Barthet
29
0
0
20 Sep 2023
Benchmarks for Pirá 2.0, a Reading Comprehension Dataset about the
  Ocean, the Brazilian Coast, and Climate Change
Benchmarks for Pirá 2.0, a Reading Comprehension Dataset about the Ocean, the Brazilian Coast, and Climate Change
Paulo Pirozelli
M. M. José
I. Silveira
Flávio Nakasato
S. M. Peres
A. Brandão
Anna H. R. Costa
Fabio Gagliardi Cozman
RALM
72
4
0
19 Sep 2023
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large
  Language Models in 167 Languages
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Thuat Nguyen
Chien Van Nguyen
Viet Dac Lai
Hieu Man
Nghia Trung Ngo
Franck Dernoncourt
Ryan Rossi
Thien Huu Nguyen
104
112
0
17 Sep 2023
Fake News Detectors are Biased against Texts Generated by Large Language
  Models
Fake News Detectors are Biased against Texts Generated by Large Language Models
Jinyan Su
Terry Yue Zhuo
Jonibek Mansurov
Di Wang
Preslav Nakov
DeLMO
60
17
0
15 Sep 2023
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language
  Models that Follow Instructions
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Federico Bianchi
Mirac Suzgun
Giuseppe Attanasio
Paul Röttger
Dan Jurafsky
Tatsunori Hashimoto
James Zou
ALMLM&MALRM
89
219
0
14 Sep 2023
Generative AI Text Classification using Ensemble LLM Approaches
Generative AI Text Classification using Ensemble LLM Approaches
Harika Abburi
Michael Suesserman
Nirmala Pudota
Balaji Veeramani
Edward Bowen
Sanmitra Bhattacharya
DeLMO
68
54
0
14 Sep 2023
SafetyBench: Evaluating the Safety of Large Language Models
SafetyBench: Evaluating the Safety of Large Language Models
Zhexin Zhang
Leqi Lei
Lindong Wu
Rui Sun
Yongkang Huang
Chong Long
Xiao Liu
Xuanyu Lei
Jie Tang
Minlie Huang
LRMLM&MAELM
129
112
0
13 Sep 2023
Beyond Traditional Teaching: The Potential of Large Language Models and
  Chatbots in Graduate Engineering Education
Beyond Traditional Teaching: The Potential of Large Language Models and Chatbots in Graduate Engineering Education
M. Abedi
Ibrahem Alshybani
M. Shahadat
M. Murillo
107
15
0
09 Sep 2023
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
Sneha Kudugunta
Isaac Caswell
Biao Zhang
Xavier Garcia
Christopher A. Choquette-Choo
...
Derrick Xin
Aditya Kusupati
Romi Stella
Ankur Bapna
Orhan Firat
134
141
0
09 Sep 2023
Can NLP Models Ídentify', 'Distinguish', and 'Justify' Questions that
  Don't have a Definitive Answer?
Can NLP Models Ídentify', 'Distinguish', and 'Justify' Questions that Don't have a Definitive Answer?
Ayushi Agarwal
Nisarg Patel
Neeraj Varshney
Mihir Parmar
Pavan Mallina
Aryan Bhavin Shah
Srihari Sangaraju
Tirth Patel
Nihar Thakkar
Chitta Baral
ELM
62
4
0
08 Sep 2023
SHAPE: A Framework for Evaluating the Ethicality of Influence
SHAPE: A Framework for Evaluating the Ethicality of Influence
Elfia Bezou-Vrakatseli
Benedikt Brückner
Luke Thorburn
TDI
65
3
0
08 Sep 2023
Explainability for Large Language Models: A Survey
Explainability for Large Language Models: A Survey
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jundong Li
LRM
106
470
0
02 Sep 2023
Large language models in medicine: the potentials and pitfalls
Large language models in medicine: the potentials and pitfalls
J. Omiye
Haiwen Gui
Shawheen J. Rezaei
James Zou
Roxana Daneshjou
LM&MA
97
81
0
31 Aug 2023
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open
  Generative Large Language Models
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Neha Sengupta
Sunil Kumar Sahu
Bokang Jia
Satheesh Katipomu
Haonan Li
...
A. Jackson
Hector Xuguang Ren
Preslav Nakov
Timothy Baldwin
Eric P. Xing
LRM
101
41
0
30 Aug 2023
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through
  the Lens of Moral Theories?
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
Jingyan Zhou
Minda Hu
Junan Li
Xiaoying Zhang
Xixin Wu
Irwin King
Helen M. Meng
LRM
89
29
0
29 Aug 2023
Challenges of GPT-3-based Conversational Agents for Healthcare
Challenges of GPT-3-based Conversational Agents for Healthcare
Fabian Lechner
Allison Lahnala
Charles F Welch
Lucie Flek
LM&MA
53
2
0
28 Aug 2023
Previous
123...789...111213
Next