ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.06439
  4. Cited By
AEON: A Method for Automatic Evaluation of NLP Test Cases

AEON: A Method for Automatic Evaluation of NLP Test Cases

13 May 2022
Jen-tse Huang
Jianping Zhang
Wenxuan Wang
Pinjia He
Yuxin Su
Michael R. Lyu
ArXivPDFHTML

Papers citing "AEON: A Method for Automatic Evaluation of NLP Test Cases"

7 / 7 papers shown
Title
The Earth is Flat? Unveiling Factual Errors in Large Language Models
The Earth is Flat? Unveiling Factual Errors in Large Language Models
Wenxuan Wang
Juluan Shi
Zhaopeng Tu
Youliang Yuan
Jen-tse Huang
Wenxiang Jiao
Michael R. Lyu
KELM
HILM
SyDa
47
1
0
01 Jan 2024
Machine Translation Testing via Syntactic Tree Pruning
Machine Translation Testing via Syntactic Tree Pruning
Quanjun Zhang
Juan Zhai
Chunrong Fang
Jiawei Liu
Dongrui Liu
Haichuan Hu
Qingyu Wang
23
3
0
01 Jan 2024
Towards Reasonable Budget Allocation in Untargeted Graph Structure
  Attacks via Gradient Debias
Towards Reasonable Budget Allocation in Untargeted Graph Structure Attacks via Gradient Debias
Zihan Liu
Yun Luo
Lirong Wu
Zicheng Liu
Stan Z. Li
AAML
22
25
0
29 Mar 2023
Transferable Adversarial Attacks on Vision Transformers with Token
  Gradient Regularization
Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization
Jianping Zhang
Yizhan Huang
Weibin Wu
Michael R. Lyu
AAML
ViT
18
49
0
28 Mar 2023
Robust Encodings: A Framework for Combating Adversarial Typos
Robust Encodings: A Framework for Combating Adversarial Typos
Erik Jones
Robin Jia
Aditi Raghunathan
Percy Liang
AAML
134
102
0
04 May 2020
Certified Robustness to Adversarial Word Substitutions
Certified Robustness to Adversarial Word Substitutions
Robin Jia
Aditi Raghunathan
Kerem Göksel
Percy Liang
AAML
183
291
0
03 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
1