MIRAGE-GenAI-2025 Dataset

MIRAGE-GenAI-2025 is a human-generated dataset encompassing the traffic of 3 Popular Generative AI Mobile Apps (ChatGPT, Copilot, and Gemini) and labeled with both the app and the specific generative activity performed.

DOWNLOAD THE PUBLIC MIRAGE-GenAI-2025 DATASET
slider image

Download
MIRAGE-GenAI-2025 New

Download the latest downloadable release

Download Icon

MIRAGE-GenAI-2025 includes the traffic generated by human experimenters using 3 popular Generative AI chatbots via their mobile apps: ChatGPT, Microsoft Copilot, and Google Gemini.

The dataset is structured in two distinct parts: a generic dataset generated via unconstrained prompts and a controlled dataset generated using a fixed prompt set.

Each app was used to perform two specific activities: Textual (T) and Multimodal (M) content generation. The latter primarily encompasses image generation accompanied by brief textual descriptions.

The dataset is released in JSON format, making available the raw traffic data captured.

APP LIST reports the details on the apps and related activities contained in the downloadable version of the dataset.


Creative Commons License
MIRAGE-GenAI-2025 dataset is released under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

MIRAGE-GenAI-2025 Paper

If you are using MIRAGE-GenAI-2025 human-generated dataset for scientific papers, academic lectures, project reports, or technical documents, please help us increasing its impact by citing the following reference:

Antonio Montieri, Alfredo Nascita, Antonio Pescapè,
From Prompts to Packets: A View from the Network on ChatGPT, Copilot, and Gemini, Computer Networks, 2026, 112237, ISSN 1389-1286, https://doi.org/10.1016/j.comnet.2026.112237.