View on GitHub

SimpleText@CLEF-2022

SimpleText is a new lab organised as a part of CLEF-2022 conference, initiated by CLEF initiative.

SimpleText@CLEF-2022 Tasks

Home Call for papers Important dates Tasks Tools
Program Publications Organisers Contact CLEF-2023

SimpleText Task Guidelines

We invite you to submit both automatic and manual runs! Manual intervention should be reported.



Task 3: Rewrite this! Given a query, simplify passages from scientific abstracts.

The goal of this task is to provide a simplified version of text passages (sentences) with regard to a query. Participants will be provided with queries and abstracts of scientific papers. The abstracts can be split into sentences. The simplified passages will be evaluated manually with eventual use of aggregating metrics.

Input format: The train and and the test data are provided in JSON and CSV formats with the following fields:

Input example:

{“snt_id”:”G11.1_2892036907_2”,”source_snt”:”With the ever increasing number of unmanned aerial vehicles getting involved in activities in the civilian and commercial domain, there is an increased need for autonomy in these systems too.”,”doc_id”:2892036907,”query_id”:”G11.1”,”query_text”:”drones”}

Output format: List of terms to be contextualized in a JSON format or a tabulated file TSV (for manual runs) with the following fields:

Output example: {“run_id”:”BTU_task_3_run1”,”manual”:1,”snt_id”:”G11.1_2892036907_2”,”simplified_snt”:”Drones are increasingly used in the civilian and commercial domain and need to be autonomous.”}

Output format checker

You can use this python3 script to check the output format. The script requires Python 3 and the Pandas library: Download python output checker

Disclaimer: By downloading and using these data, you agree to the terms of use. Any use of the data for any purpose other than academic research, would be in violation of the intended use of these data.

Therefore, by downloading and using these data you give the following assurances with respect to the SimpleText data:

  1. You will not use nor permit others to use the data in the SimpleText datasets in any way except for classes and academic research.
  2. You will not at any time disclose, give, or transmit (in any manner or form or for any purpose) the data (or any portion thereof) to any location or person, including but not limiting to making the data available on the Internet, and copying the data onto any cloud-based storage system.
  3. You will not release nor permit others to release the dataset or any part of it to any person.

In case of violation of the conditions for access to the data for scientific purposes, this access may be withdrawn from the research entity and/or from the researcher. The research entity may also be liable to pay compensation for damages for third parties or asked to take disciplinary action against the offending researcher.

Evaluation

The simplified passages will be evaluated manually with eventual use of aggregating metrics.

Result submission:

Participants should put their run results into the folder Documents created for their user and submit them by email to contact@simpletext-project.com.

The email subject has to be in the format [CLEF TASK 3] TEAM_ID.

Runs should be submitted as a ZIP folder of the corresponding JSON files. Manual runs are allowed to be submitted in a CSV format.

A confirmation email will be sent within 2 days after the submission deadline.

How to Cite

If you extend or use this work, please cite the paper where it was introduced:

Liana Ermakova, Eric SanJuan, Jaap Kamps, Stéphane Huet, Irina Ovchinnikova, Diana Nurbakova, 
Sílvia Araújo, Radia Hannachi, Elise Mathurin, and Patrice Bellot. 2022. 
Overview of the CLEF 2022 SimpleText Lab: Automatic Simplification of Scientific Texts. 
In Experimental IR Meets Multilinguality, Multimodality, and Interaction: 13th International 
Conference of the CLEF Association, CLEF 2022, Bologna, Italy, September 5–8, 2022, Proceedings. 
Springer-Verlag, Berlin, Heidelberg, 470–494. https://doi.org/10.1007/978-3-031-13643-6_28

Paper

Dowload .BIB