ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.04359
  4. Cited By
Ethical and social risks of harm from Language Models

Ethical and social risks of harm from Language Models

8 December 2021
Laura Weidinger
John F. J. Mellor
Maribeth Rauh
Conor Griffin
J. Uesato
Po-Sen Huang
Myra Cheng
Mia Glaese
Borja Balle
Atoosa Kasirzadeh
Zachary Kenton
S. Brown
Will Hawkins
T. Stepleton
Courtney Biles
Abeba Birhane
Julia Haas
Laura Rimell
Lisa Anne Hendricks
William S. Isaac
Sean Legassick
G. Irving
Iason Gabriel
    PILM
ArXiv (abs)PDFHTML

Papers citing "Ethical and social risks of harm from Language Models"

50 / 634 papers shown
Title
LLM on FHIR -- Demystifying Health Records
LLM on FHIR -- Demystifying Health Records
Paul Schmiedmayer
Adrit Rao
Philipp Zagar
Vishnu Ravi
Aydin Zahedivash
Arash Fereydooni
Oliver Aalami
LM&MA
61
9
0
25 Jan 2024
Beyond Behaviorist Representational Harms: A Plan for Measurement and
  Mitigation
Beyond Behaviorist Representational Harms: A Plan for Measurement and Mitigation
Jennifer Chien
David Danks
100
21
0
25 Jan 2024
The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support
The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support
Inhwa Song
Sachin R. Pendse
Neha Kumar
Munmun De Choudhury
AI4MH
79
17
0
25 Jan 2024
Question answering systems for health professionals at the point of care
  -- a systematic review
Question answering systems for health professionals at the point of care -- a systematic review
Gregory Kell
A. Roberts
Serge Umansky
Linglong Qian
Davide Ferrari
Frank Soboczenski
Byron Wallace
Nikhil Patel
Iain J. Marshall
AI4MH
76
11
0
24 Jan 2024
ARGS: Alignment as Reward-Guided Search
ARGS: Alignment as Reward-Guided Search
Maxim Khanov
Jirayu Burapacheep
Yixuan Li
125
62
0
23 Jan 2024
From Understanding to Utilization: A Survey on Explainability for Large
  Language Models
From Understanding to Utilization: A Survey on Explainability for Large Language Models
Haoyan Luo
Lucia Specia
128
25
0
23 Jan 2024
Generative AI in EU Law: Liability, Privacy, Intellectual Property, and
  Cybersecurity
Generative AI in EU Law: Liability, Privacy, Intellectual Property, and Cybersecurity
Claudio Novelli
F. Casolari
Philipp Hacker
Giorgio Spedicato
Luciano Floridi
AILawSILM
101
46
0
14 Jan 2024
Intention Analysis Makes LLMs A Good Jailbreak Defender
Intention Analysis Makes LLMs A Good Jailbreak Defender
Yuqi Zhang
Liang Ding
Lefei Zhang
Dacheng Tao
LLMSV
73
29
0
12 Jan 2024
A Computational Framework for Behavioral Assessment of LLM Therapists
A Computational Framework for Behavioral Assessment of LLM Therapists
Yu Ying Chiu
Ashish Sharma
Inna Wanyin Lin
Tim Althoff
AI4MH
81
43
0
01 Jan 2024
Astraios: Parameter-Efficient Instruction Tuning Code Large Language
  Models
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Terry Yue Zhuo
A. Zebaze
Nitchakarn Suppattarachai
Leandro von Werra
H. D. Vries
Qian Liu
Niklas Muennighoff
ALM
93
18
0
01 Jan 2024
Is Knowledge All Large Language Models Needed for Causal Reasoning?
Is Knowledge All Large Language Models Needed for Causal Reasoning?
Hengrui Cai
Shengjie Liu
Rui Song
LRMELM
119
13
0
30 Dec 2023
Structured Packing in LLM Training Improves Long Context Utilization
Structured Packing in LLM Training Improves Long Context Utilization
Konrad Staniszewski
Szymon Tworkowski
Sebastian Jaszczur
Yu Zhao
Henryk Michalewski
Lukasz Kuciñski
Piotr Milo's
136
13
0
28 Dec 2023
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Chunpu Xu
Steffi Chern
Ethan Chern
Ge Zhang
Zekun Wang
Ruibo Liu
Jing Li
Jie Fu
Pengfei Liu
73
20
0
26 Dec 2023
From Bytes to Biases: Investigating the Cultural Self-Perception of
  Large Language Models
From Bytes to Biases: Investigating the Cultural Self-Perception of Large Language Models
Wolfgang Messner
Tatum Greene
Josephine Matalone
74
5
0
21 Dec 2023
NLP for Maternal Healthcare: Perspectives and Guiding Principles in the
  Age of LLMs
NLP for Maternal Healthcare: Perspectives and Guiding Principles in the Age of LLMs
Maria Antoniak
Aakanksha Naik
Carla S. Alvarado
Lucy Lu Wang
Irene Y. Chen
AILaw
106
17
0
19 Dec 2023
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the
  Generative Artificial Intelligence (AI) Research Landscape
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
157
54
0
18 Dec 2023
Towards Designing a Question-Answering Chatbot for Online News:
  Understanding Questions and Perspectives
Towards Designing a Question-Answering Chatbot for Online News: Understanding Questions and Perspectives
Md. Naimul Hoque
Ayman A Mahfuz
Mayukha Kindi
Naeemul Hassan
62
5
0
17 Dec 2023
Neurosymbolic Value-Inspired AI (Why, What, and How)
Neurosymbolic Value-Inspired AI (Why, What, and How)
Amit P. Sheth
Kaushik Roy
54
5
0
15 Dec 2023
Multilingual large language models leak human stereotypes across
  language boundaries
Multilingual large language models leak human stereotypes across language boundaries
Yang Trista Cao
Anna Sotnikova
Jieyu Zhao
Linda X. Zou
Rachel Rudinger
Hal Daumé
PILM
101
11
0
12 Dec 2023
Generative agent-based modeling with actions grounded in physical,
  social, or digital space using Concordia
Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia
A. Vezhnevets
J. Agapiou
Avia Aharon
Ron Ziv
Jayd Matyas
Edgar A. Duénez-Guzmán
William A. Cunningham
Simon Osindero
Danny Karmon
Joel Z Leibo
LLMAGLM&RoAI4CE
106
50
0
06 Dec 2023
A Survey on Large Language Model (LLM) Security and Privacy: The Good,
  the Bad, and the Ugly
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly
Yifan Yao
Jinhao Duan
Kaidi Xu
Yuanfang Cai
Eric Sun
Yue Zhang
PILMELM
125
561
0
04 Dec 2023
Developing Linguistic Patterns to Mitigate Inherent Human Bias in
  Offensive Language Detection
Developing Linguistic Patterns to Mitigate Inherent Human Bias in Offensive Language Detection
Toygar Tanyel
Besher Alkurdi
S. Ayvaz
40
0
0
04 Dec 2023
Personality of AI
Personality of AI
Byunggu Yu
Junwhan Kim
37
1
0
03 Dec 2023
Evaluating Large Language Model Creativity from a Literary Perspective
Evaluating Large Language Model Creativity from a Literary Perspective
Murray Shanahan
Catherine Clarke
50
8
0
30 Nov 2023
I Know You Did Not Write That! A Sampling Based Watermarking Method for
  Identifying Machine Generated Text
I Know You Did Not Write That! A Sampling Based Watermarking Method for Identifying Machine Generated Text
Kaan Efe Keles
Ömer Kaan Gürbüz
Mucahid Kutlu
WaLM
41
2
0
29 Nov 2023
SoUnD Framework: Analyzing (So)cial Representation in (Un)structured
  (D)ata
SoUnD Framework: Analyzing (So)cial Representation in (Un)structured (D)ata
Mark Díaz
Sunipa Dev
Emily Reif
Remi Denton
Vinodkumar Prabhakaran
103
4
0
28 Nov 2023
Ethical Implications of ChatGPT in Higher Education: A Scoping Review
Ethical Implications of ChatGPT in Higher Education: A Scoping Review
Ming Li
Ariunaa Enkhtur
Fei Cheng
B. Yamamoto
97
7
0
24 Nov 2023
Potential Societal Biases of ChatGPT in Higher Education: A Scoping Review
Potential Societal Biases of ChatGPT in Higher Education: A Scoping Review
Ming Li
Ariunaa Enkhtur
B. Yamamoto
Fei Cheng
Lilan Chen
AI4CE
115
7
0
24 Nov 2023
The HaLLMark Effect: Supporting Provenance and Transparent Use of Large
  Language Models in Writing with Interactive Visualization
The HaLLMark Effect: Supporting Provenance and Transparent Use of Large Language Models in Writing with Interactive Visualization
Md. Naimul Hoque
Tasfia Mashiat
Bhavya Ghai
Cecilia Shelton
Fanny Chevalier
Kari Kraus
Niklas Elmqvist
99
19
0
21 Nov 2023
Compositional Capabilities of Autoregressive Transformers: A Study on
  Synthetic, Interpretable Tasks
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
87
12
0
21 Nov 2023
IMGTB: A Framework for Machine-Generated Text Detection Benchmarking
IMGTB: A Framework for Machine-Generated Text Detection Benchmarking
Michal Spiegel
Dominik Macko
DeLMOVLM
58
5
0
21 Nov 2023
Mitigating Biases for Instruction-following Language Models via Bias
  Neurons Elimination
Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination
Nakyeong Yang
Taegwan Kang
Stanley Jungkyu Choi
Honglak Lee
Kyomin Jung
71
12
0
16 Nov 2023
Simulating Opinion Dynamics with Networks of LLM-based Agents
Simulating Opinion Dynamics with Networks of LLM-based Agents
Yun-Shiuan Chuang
Agam Goyal
Nikunj Harlalka
Siddharth Suresh
Robert Hawkins
Sijia Yang
Dhavan Shah
Junjie Hu
Timothy T. Rogers
AI4CE
121
73
0
16 Nov 2023
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic
  Fact-checkers
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers
Yuxia Wang
Revanth Gangi Reddy
Zain Muhammad Mujahid
Arnav Arora
Aleksandr Rubashevskii
...
Nadav Borenstein
Aditya Pillai
Isabelle Augenstein
Iryna Gurevych
Preslav Nakov
HILM
125
42
0
15 Nov 2023
AART: AI-Assisted Red-Teaming with Diverse Data Generation for New
  LLM-powered Applications
AART: AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications
Bhaktipriya Radharapu
Kevin Robinson
Lora Aroyo
Preethi Lahoti
106
41
0
14 Nov 2023
SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in
  Large Language Models
SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models
Bertie Vidgen
Nino Scherrer
Hannah Rose Kirk
Rebecca Qian
Anand Kannappan
Scott A. Hale
Paul Röttger
ALMELM
116
29
0
14 Nov 2023
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming
Suyu Ge
Chunting Zhou
Rui Hou
Madian Khabsa
Yi-Chia Wang
Qifan Wang
Jiawei Han
Yuning Mao
AAMLLRM
85
104
0
13 Nov 2023
Explicit Foundation Model Optimization with Self-Attentive Feed-Forward
  Neural Units
Explicit Foundation Model Optimization with Self-Attentive Feed-Forward Neural Units
Jake Ryland Williams
Haoran Zhao
124
0
0
13 Nov 2023
Reducing the Need for Backpropagation and Discovering Better Optima With
  Explicit Optimizations of Neural Networks
Reducing the Need for Backpropagation and Discovering Better Optima With Explicit Optimizations of Neural Networks
Jake Ryland Williams
Haoran Zhao
117
0
0
13 Nov 2023
Understanding Users' Dissatisfaction with ChatGPT Responses: Types,
  Resolving Tactics, and the Effect of Knowledge Level
Understanding Users' Dissatisfaction with ChatGPT Responses: Types, Resolving Tactics, and the Effect of Knowledge Level
Yoonsu Kim
Jueon Lee
Seoyoung Kim
Jaehyuk Park
Juho Kim
130
41
0
13 Nov 2023
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
Yassir Fathullah
Chunyang Wu
Egor Lakomkin
Ke Li
Junteng Jia
Shangguan Yuan
Jay Mahadeokar
Ozlem Kalinli
Christian Fuegen
Michael Seltzer
LM&MAMLLMAuLLM
116
43
0
12 Nov 2023
Online Advertisements with LLMs: Opportunities and Challenges
Online Advertisements with LLMs: Opportunities and Challenges
Soheil Feizi
Mohammadtaghi Hajiaghayi
Keivan Rezaei
Suho Shin
OffRL
162
11
0
11 Nov 2023
A Survey on Hallucination in Large Language Models: Principles,
  Taxonomy, Challenges, and Open Questions
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions
Lei Huang
Weijiang Yu
Weitao Ma
Weihong Zhong
Zhangyin Feng
...
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
LRMHILM
142
935
0
09 Nov 2023
Identifying and Mitigating Vulnerabilities in LLM-Integrated
  Applications
Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications
Fengqing Jiang
Zhangchen Xu
Luyao Niu
Wei Ping
Jinyuan Jia
Bo Li
Radha Poovendran
AAML
113
36
0
07 Nov 2023
Benefits and Harms of Large Language Models in Digital Mental Health
Benefits and Harms of Large Language Models in Digital Mental Health
Munmun De Choudhury
Sachin R. Pendse
Neha Kumar
LM&MAAI4MH
86
47
0
07 Nov 2023
Uncovering Intermediate Variables in Transformers using Circuit Probing
Uncovering Intermediate Variables in Transformers using Circuit Probing
Michael A. Lepori
Thomas Serre
Ellie Pavlick
161
7
0
07 Nov 2023
Quantifying Uncertainty in Natural Language Explanations of Large
  Language Models
Quantifying Uncertainty in Natural Language Explanations of Large Language Models
Sree Harsha Tanneru
Chirag Agarwal
Himabindu Lakkaraju
LRM
68
15
0
06 Nov 2023
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
A Simple yet Efficient Ensemble Approach for AI-generated Text Detection
Harika Abburi
Kalyani Roy
Michael Suesserman
Nirmala Pudota
Balaji Veeramani
Edward Bowen
Sanmitra Bhattacharya
DeLMO
92
10
0
06 Nov 2023
FinGPT: Large Generative Models for a Small Language
FinGPT: Large Generative Models for a Small Language
Risto Luukkonen
Ville Komulainen
Jouni Luoma
Anni Eskelinen
Jenna Kanerva
...
Mikko Merioksa
Jyrki Heinonen
Aija Vahtola
Samuel Antao
S. Pyysalo
LM&MA
62
49
0
03 Nov 2023
Contextual Confidence and Generative AI
Contextual Confidence and Generative AI
Shrey Jain
Zoe Hitzig
Pamela Mishkin
112
4
0
02 Nov 2023
Previous
123...678...111213
Next