K-Viscuit 🍪: Multi-Choice VQA Dataset for Korean Culture

This repository presents the K-Viscuit 🍪 dataset, a Multi-Choice Visual Question Answering (VQA) dataset designed to evaluate Vision-Language Models (VLMs) on Korean culture. This dataset is part of the research presented in our paper: Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration, arXiv 2024 June. The dataset was created through a Human-VLM collaboration, and examples of the data are as follows.

Dataset Availability

The dataset is available both in this repository and HuggingFace Datasets.

Quickstart

To evaluate the llava-hf/llava-v1.6-mistral-7b-hf model on our dataset, please refer to the run_vqa.py script provided in this repository.

BibTex

For more details about our dataset, please refer to our paper!

@article{baek2024evaluating,
  title={Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration},
  author={Baek, Yujin and Park, ChaeHun and Kim, Jaeseok and Heo, Yu-Jung and Chang, Du-Seong and Choo, Jaegul},
  journal={arXiv preprint arXiv:2406.16469},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
dataset		dataset
README.md		README.md
examples.png		examples.png
run_vqa.py		run_vqa.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

K-Viscuit 🍪: Multi-Choice VQA Dataset for Korean Culture

Dataset Availability

Quickstart

BibTex

About

Releases

Packages

Languages

ddehun/k-viscuit

Folders and files

Latest commit

History

Repository files navigation

K-Viscuit 🍪: Multi-Choice VQA Dataset for Korean Culture

Dataset Availability

Quickstart

BibTex

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages