Dear Maik,
I’m doing some post-analysis and have a question about one of the datasets on TIRA.
I have a run result here:
- Dataset: pan25-generative-ai-detection-20260508-test
- Run: 2026-05-22-19-42-33 (submission alternating-contingency)
As far as I recall, the PAN’25 test set was never released publicly (it stays blinded on TIRA), so I want to confirm what this dataset actually is.
Is pan25-generative-ai-detection-20260508-test exactly the same data that was used as the official PAN’25 test set — i.e., the identical documents and ground-truth labels, with
no resampling, additions, re-obfuscation, or relabeling?
The reason I ask: I’d like to know whether my score on this dataset is directly comparable to the published PAN’25 leaderboard results. If the composition differs in any way
from the 2025 official test set, the comparison wouldn’t be apples-to-apples.
Thanks very much,
– Yurii