VisIT-Bench Leaderboard

To submit your results to the leaderboard, you can run our auto-evaluation code, following the instructions here. Once you are happy with the results, you can send to this mail. Please include in your email 1) a name for your model, 2) your team name (including your affiliation), and optionally, 3) a github repo or paper link. Please also attach your predictions: you can add a "predictions" column to this csv.

Category
Model
Elo
# Matches
Win vs. Reference (w/ # ratings)

Single Image

Human Verified Reference

1349
6480
65.44% (n=136)