Lucio_Challenge
Total Prize Pool : $7000
15 March
Lucio HQ
10:00 AM – 8:00 PM
Merch for shortlisted candidates
200 Document Corpus. 15 Questions. 30 Seconds.
Build the fastest document ingestion pipeline
Ingest and structure a fixed corpus of 200 documents for efficient downstream
reasoning.
Answer complex questions against ground truth
Accurately answer 15 predefined questions, evaluated against known correct answers from the full corpus.
Meet strict performance
targets
Deliver end-to-end results in under 30
seconds, balancing latency, accuracy, and robustness.
Talk is cheap. Benchmarks aren’t.
If you can do this in under 30s, you win
$5000
The fastest solution wins
$1000
The most innovative architecture wins
$1000
Use this corpus to understand:
The document types you need to ingest
The formats you’ll encounter
The scale and structure of the data
What’s guaranteed:
Each document will be under 50 MB
Only formats present in the test corpus will appear in the final dataset
Ground-truth data is provided to evaluate correctness.
Your system will be evaluated on whether it presents accurate answers, grounded in the source documents.
Questions
Answers
Document Name & Page Numbers of the final answer
Use whatever infrastructure works best for you. You can run your solution on your own machine, lambdas, or wherever else you want.
You’re welcome to use any third-party tools or services, provided you have the necessary licenses/authorisations/permissions to use them.
Just note that you’ll cover the cost of running your solution.
Your solution will need to connect to a publicly hosted Lucio server to fetch the final corpus and submit your answers.
For any clarifications, reach out to challenge@lucioai.com
Strong problem-solving and execution mindset required
Phase 1 : Online Submission (March 1)
We release:
Test dataset
Problem statement
Constraints & evaluation criteria
Teams build and submit solutions via our website
Phase 2 : Shortlisting (March 3)
Our team reviews submissions
Selected teams are invited to our office
Phase 3: On-site Sprint & Presentations (March 15)
Final sprint & refinement
Presentations + judging
Winner announcement
System architecture
Time taken to execute
{Important_Dates}
6 Feb
1 Mar
3 Mar
15 Mar
{Why_Participate?}
Work on a real, non-trivial problem
Push the boundaries of high-stakes document intelligence
Win exciting prizes
By registering, participants consent to photography and videography during the event.
All participants must adhere to a code of conduct ensuring respect, fairness, and collaboration.
By participating, you agree that your solution will be published in the public domain to support open learning and community research.
{Powered_By}
Partner
Our mission is to advance AI engineering through public research and open collaboration. All solutions will be made public for the community to build on.

