Track 1: Event-Enriched Image Captioning
EVENTA Grand Challenge - ACM Multimedia 2025

To participate in the EVENTA 2025 Grand Challenge, please first register by submitting the form.

Overview

This track aims to generate captions that provide richer, more comprehensive information about an image. These captions go beyond simple visual descriptions by offering deeper insights, including the names and attributes of objects, the timing, context, outcomes of events, and other crucial details—information that cannot be gleaned from merely observing the image. Given an image, participants are required to search relevant articles in a provided external database and extract necessary information to enrich the image caption. This retrieval augmentation generation track facilitates the creation of more coherent and detailed narratives, capturing not only the visible elements but also the underlying context and significance of the scene, ultimately offering a more complete understanding of what the image represents.

Submission Instructions

Participants must submit a CSV file named using the following format: TeamName_EVENTA2025_Track1.csv. This file must be compressed into a ZIP archive named submission.zip before uploading to CodaLab.

The CSV file should include predictions for all images in the query set. It must contain 12 columns, separated by ; (semicolons), with the following structure:

  • Column 1: Query image ID
  • Columns 2–11: Top-10 retrieved article IDs, listed in descending order of relevance (from top-1 to top-10). If an article cannot be retrieved, use # as a placeholder.
  • Column 12: The generated caption corresponding to the query image

CSV Row Format Template:

<query_id>;<article_id_1>;<article_id_2>;...;<article_id_10>;<generated_caption>
<query_id>;<article_id_1>;#;...;#;<generated_caption>

There is no requirement to sort the rows by query ID—this will be handled automatically during evaluation.

We provide a submission example:

12312;56712;56723;56734;56745;56756;56767;56778;56789;56790;56701;A group of children playing soccer on a sunny afternoon.
12334;56712;#;#;#;#;#;#;#;#;#;A man riding a bicycle through a busy city street.
12345;56712;56723;56734;56745;56756;#;#;#;#;#;A cat sitting on a windowsill looking outside at the rain.

Participants also require to submit a detailed paper through the official challenge platform to validate their solutions.

The platform will be made available in the coming days. We kindly ask for your patience in the meantime.

Evaluation
The details will be made available in the coming days.
Leaderboard
TBA