generated from HugoBlox/theme-research-group
-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
73 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,73 @@ | ||
--- | ||
# Documentation: https://wowchemy.com/docs/managing-content/ | ||
|
||
title: "Seminar: \"Prudent NLG Evaluation with Humans\"" | ||
# event: | ||
# event_url: | ||
location: Abacws | ||
# address: | ||
# street: | ||
# city: | ||
# region: | ||
# postcode: | ||
# country: | ||
summary: Talk by [Vilém Zouhar](https://vilda.net/) (ETH Zürich, Switzerland) | ||
abstract: "Annually, research teams spend large amounts of money to evaluate the quality of NLG systems (WMT for machine translation, inter alia). We'll first look at how to speed up and improve the quality of the annotators' work by pre-filling annotations with automatic quality estimation ([ESA](https://aclanthology.org/2024.wmt-1.131/), [ESAᴬᴵ](https://arxiv.org/abs/2406.12419)). In the second part, we'll take the automatization a step further and try to determine which segments do not need to be evaluated at all. For this, we make use of methods from psychometrics for efficient yet informative testset construction for human students. In our case, the students to be tested are NLG systems." | ||
|
||
# Talk start and end times. | ||
# End time can optionally be hidden by prefixing the line with `#`. | ||
date: 2025-01-16T13:00:00Z | ||
date_end: 2025-01-16T14:00:00Z | ||
all_day: false | ||
|
||
# Schedule page publish date (NOT event date). | ||
publishDate: 2025-01-13T00:00:00Z | ||
|
||
authors: [alvamanchegof] | ||
tags: [] | ||
|
||
# Is this a featured event? (true/false) | ||
featured: false | ||
|
||
# Featured image | ||
# To use, add an image named `featured.jpg/png` to your page's folder. | ||
# Focal points: Smart, Center, TopLeft, Top, TopRight, Left, Right, BottomLeft, Bottom, BottomRight. | ||
image: | ||
caption: "" | ||
focal_point: "" | ||
preview_only: false | ||
|
||
# Custom links (optional). | ||
# Uncomment and edit lines below to show custom links. | ||
# links: | ||
# - name: Follow | ||
# url: https://twitter.com | ||
# icon_pack: fab | ||
# icon: twitter | ||
|
||
# Optional filename of your slides within your event's folder or a URL. | ||
url_slides: | ||
|
||
url_code: | ||
url_pdf: | ||
url_video: | ||
|
||
# Markdown Slides (optional). | ||
# Associate this event with Markdown slides. | ||
# Simply enter your slide deck's filename without extension. | ||
# E.g. `slides = "example-slides"` references `content/slides/example-slides.md`. | ||
# Otherwise, set `slides = ""`. | ||
slides: "" | ||
|
||
# Projects (optional). | ||
# Associate this post with one or more of your projects. | ||
# Simply enter your project's folder or file name without extension. | ||
# E.g. `projects = ["internal-project"]` references `content/project/deep-learning/index.md`. | ||
# Otherwise, set `projects = []`. | ||
projects: [] | ||
--- | ||
|
||
**Invited Speaker:** [Vilém Zouhar](https://vilda.net/) (ETH Zürich, Switzerland) | ||
|
||
**Bio:** | ||
Vilém is a PhD student at ETH Zürich working on both human and automatic evaluation of MT/NLG systems, balancing costs, quality, and bias. |