Skip to content

Official repository for "Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration, arXiv 2024.06" (https://arxiv.org/pdf/2406.16469)

Notifications You must be signed in to change notification settings

ddehun/k-viscuit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

K-Viscuit 🍪: Multi-Choice VQA Dataset for Korean Culture

This repository presents the K-Viscuit 🍪 dataset, a Multi-Choice Visual Question Answering (VQA) dataset designed to evaluate Vision-Language Models (VLMs) on Korean culture. This dataset is part of the research presented in our paper: Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration, arXiv 2024 June. The dataset was created through a Human-VLM collaboration, and examples of the data are as follows.

Selected examples of K-Viscuit dataset

Dataset Availability

The dataset is available both in this repository and HuggingFace Datasets.

Quickstart

To evaluate the llava-hf/llava-v1.6-mistral-7b-hf model on our dataset, please refer to the run_vqa.py script provided in this repository.

BibTex

For more details about our dataset, please refer to our paper!

@article{baek2024evaluating,
  title={Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration},
  author={Baek, Yujin and Park, ChaeHun and Kim, Jaeseok and Heo, Yu-Jung and Chang, Du-Seong and Choo, Jaegul},
  journal={arXiv preprint arXiv:2406.16469},
  year={2024}
}

About

Official repository for "Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration, arXiv 2024.06" (https://arxiv.org/pdf/2406.16469)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages