Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.03570
Cited By
EvalAI: Towards Better Evaluation Systems for AI Agents
10 February 2019
Deshraj Yadav
Rishabh Jain
Harsh Agrawal
Prithvijit Chattopadhyay
Taranjeet Singh
Akash Jain
Shivkaran Singh
Stefan Lee
Dhruv Batra
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EvalAI: Towards Better Evaluation Systems for AI Agents"
11 / 11 papers shown
Title
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One
Michael Ranzinger
Greg Heinrich
Jan Kautz
Pavlo Molchanov
VLM
46
42
0
10 Dec 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
27
7
0
06 Jun 2023
Alexa Arena: A User-Centric Interactive Platform for Embodied AI
Qiaozi Gao
Govind Thattai
Suhaila Shakiah
Xiaofeng Gao
Shreyas Pansare
...
Michael Johnston
R. Ghanadan
Arindam Mandal
Dilek Z. Hakkani-Tür
Premkumar Natarajan
6
27
0
02 Mar 2023
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Leandro von Werra
Lewis Tunstall
A. Thakur
A. Luccioni
Tristan Thrush
...
Julien Chaumond
Margaret Mitchell
Alexander M. Rush
Thomas Wolf
Douwe Kiela
ELM
25
24
0
30 Sep 2022
ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Matt Deitke
Eli VanderBilt
Alvaro Herrasti
Luca Weihs
Jordi Salvador
...
Winson Han
Eric Kolve
Ali Farhadi
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
49
238
0
14 Jun 2022
Neural Latents Benchmark '21: Evaluating latent variable models of neural population activity
Felix Pei
Joel Ye
D. Zoltowski
Anqi Wu
Raeed H. Chowdhury
...
L. Miller
Jonathan W. Pillow
Il Memming Park
Eva L. Dyer
C. Pandarinath
55
87
0
09 Sep 2021
Accelerating Probabilistic Volumetric Mapping using Ray-Tracing Graphics Hardware
Heajung Min
K. Han
Young J. Kim
11
6
0
20 Nov 2020
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Shikib Mehri
Mihail Eric
Dilek Z. Hakkani-Tür
ELM
26
136
0
28 Sep 2020
ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning
Weihao Yu
Zihang Jiang
Yanfei Dong
Jiashi Feng
LRM
25
245
0
11 Feb 2020
OpenEDS: Open Eye Dataset
Stephan J. Garbin
Yiru Shen
Immo Schuetz
Robert Cavin
Gregory Hughes
S. Talathi
MDE
16
70
0
30 Apr 2019
nocaps: novel object captioning at scale
Harsh Agrawal
Karan Desai
Yufei Wang
Xinlei Chen
Rishabh Jain
Mark Johnson
Dhruv Batra
Devi Parikh
Stefan Lee
Peter Anderson
VLM
21
470
0
20 Dec 2018
1