Important Dates

Date	Event
01/22/2026	Public launch • Call for participation • Open registration • Release starter kit
02/09/2026	Release 5 benchmark sets
04/24/2026	Track A and Track B submission deadline (11:59 PM PT)
05/01/2026	Writeup deadline (11:59 PM PT)
05/11/2026	Notify winners
05/17/2026 - 05/22/2026	MLSys 2026 - award ceremony

Problem Description

Please see PROBLEM.md.

Submission Categories & Format

We are hosting two distinct tracks to celebrate both systems engineering and AI reasoning. Teams may participate in one or both tracks.

Submission Mechanism

Submissions will be accepted via the Official Submission Form. You will be asked to provide your team details and upload your solution as a single .zip archive.

Deadline: April 24, 2026, 11:59 PM PT.
File Naming: Please name your archive using the format: TeamName_Track_SubmissionNumber.zip (e.g., DeepRunners_TrackA_1.zip).
Form Settings Note: when uploading, ensure your file is a standard .zip archive.

Track A: Systems Engineering (Human-Written)

This track focuses on the efficiency and robustness of the scheduling algorithm itself.

Deliverable: A compiled binary executable named mlsys.
Interface: your binary must accept two command-line arguments:

�$ ./mlsys <path_to_input.json> <path_to_output.json>

Note: The timeout is enforced by the organizer's harness, not passed as an argument. Your binary simply needs to write the solution file before it gets killed.
Zip Contents:
- mlsys: The statically linked binary (must run on Ubuntu 22.04 LTS).
- source/: Complete source code (C++/Rust/etc.) for verification.
- writeup.pdf: a 2-page summary of your approach.

Track B: Agent Reasoning (AI-Generated)

This track benchmarks the ability of Gemini-powered Agents to solve scheduling puzzles. To ensure fair comparison, your agent must exclusively utilize the Google Gemini API for its reasoning and generation.

Deliverable: a Python script named agent.py.
Interface: Your script will be executed as follows:

$ python3 agent.py <path_to_input.json> <path_to_output.json>

Environment:
1. The organizers will inject the GOOGLE_API_KEY environment variable.
2. Network access is strictly blocked except to the Gemini API endpoint (generativelanguage.googleapis.com). Calls to any other LLM provider (OpenAI, Anthropic, etc.) will fail.
3. Timeout: Your agent script is strictly limited to 10 minutes of execution time per benchmark (including all API calls and local processing).
Zip Contents:
1. agent.py: The entry point script.
2. requirements.txt: A list of Python dependencies (e.g., google-generativeai, numpy).
3. prompts/: A folder containing your system prompts, few-shot examples, or agent definitions (required for reproducibility).
4. writeup.pdf: A short (1-2 page) summary of your prompting strategy and agent architecture.

Both tracks share the same set of 25 benchmark problems.

Resources and Compute

To ensure fair play and broad accessibility, this contest is designed to be Zero Cost for all participants.

For Track A (Systems):

Standard Hardware: since the problem is a specialized graph scheduling puzzle (JSON-to-JSON), no special hardware is required for the solution itself. A standard laptop or free Colab CPU instance is sufficient to run the mlsys binary and generate schedules.

For Track B (Agents):

Free Intelligence: participants are encouraged to use Gemini 1.5 Flash via Google AI Studio. The free tier provides approximately 1,500 requests per day, which is sufficient for developing and testing robust agentic schedulers.

Getting Started

To help you get started, we have released a separate GitHub repository containing several C++ files that define the basic problem and solution classes, along with various utilities. You are under no obligation to use these source codes.

File	Description
mlsys.h	Defines the Problem and Solution classes, along with methods for file I/O and solution evaluation.

Please don't hesitate reaching out to the Contest Organizers if you encounter any trouble getting these materials to work.

Benchmarks

The contest will involve twenty-five industrial benchmarks; five have been released in advance to contest participants (for testing purposes), and the remaining twenty will be withheld for evaluation:

Note on Timeouts: The timeouts listed below apply strictly to Track A (Systems). For Track B (Agents), a fixed timeout of 10 minutes is applied to every benchmark to accommodate LLM inference latency.

Benchmark Name	# Nodes	# Edges	Timeout	Will be released to participants?
mlsys-2026-1	5	9	2 second(s)	Yes
mlsys-2026-2	TBD	TBD	2 second(s)
mlsys-2026-3	TBD	TBD	2 second(s)
mlsys-2026-4	TBD	TBD	2 second(s)
mlsys-2026-5	19	34	5 second(s)	Yes
mlsys-2026-6	TBD	TBD	5 second(s)
mlsys-2026-7	TBD	TBD	5 second(s)
mlsys-2026-8	TBD	TBD	5 second(s)
mlsys-2026-9	32	56	15 second(s)	Yes
mlsys-2026-10	TBD	TBD	15 seconds(s)
mlsys-2026-11	TBD	TBD	15 seconds(s)
mlsys-2026-12	TBD	TBD	15 seconds(s)
mlsys-2026-13	63	126	30 seconds(s)	Yes
mlsys-2026-14	TBD	TBD	30 seconds(s)
mlsys-2026-15	TBD	TBD	30 seconds(s)
mlsys-2026-16	TBD	TBD	30 seconds(s)
mlsys-2026-17	103	198	60 seconds(s)	Yes
mlsys-2026-18	TBD	TBD	60 seconds(s)
mlsys-2026-19	TBD	TBD	60 seconds(s)
mlsys-2026-20	TBD	TBD	60 seconds(s)
mlsys-2026-21	TBD	TBD	120 seconds(s)
mlsys-2026-22	TBD	TBD	120 seconds(s)
mlsys-2026-23	TBD	TBD	120 seconds(s)
mlsys-2026-24	TBD	TBD	120 seconds(s)

All benchmarks will become publicly available after the contest is complete.

Evaluation and Scoring

The contest organizers will execute each team's binary across the twenty withheld benchmarks on a dedicated 8-core Linux workstation with 32GB of RAM. Rankings will be established by calculating the total number of points per team, where points are derived by normalizing cost values against the best submitted solution.

For example, assume the following raw costs:

Team	Benchmark X	Benchmark Y	Benchmark Z
Team A	cost: 100	cost: 500	cost: 800
Team B	cost: 300	cost: 200	cost: 900
Team C	cost: 600	cost: 700	cost: 400
Min. Team Cost	cost: 100	cost: 200	cost: 400

The points that a team earns for a particular benchmark is then equal to min_team_cost / team_cost

Team	Benchmark X	Benchmark Y	Benchmark Z	Total Points	Ranking
Team A	1.000 points	0.400 points	0.500 points	1.900 points	First Place 🥇
Team B	0.333 points	1.000 points	0.444 points	1.778 points	Second Place 🥈
Team C	0.167 points	0.286 points	1.000 points	1.452 points	Third Place 🥉

Prizes and Awards

Thanks to our sponsor Google, we are offering a $50,000 Total Prize Pool, split evenly between the two tracks to encourage diversity in approaches.

Track	Gold (1st)	Silver (2nd)	Bronze (3rd)	Honorable Mentions
Track A: Systems	$10,000	$7,500	$5,000	$2,500 (shared)
Track B: Agents	$10,000	$7,500	$5,000	$2,500 (shared)

Award Rules:

Open Source: To be eligible for cash prizes, winning teams must open-source their solution (code or agent prompts) under an MIT/Apache 2.0 license within 2 months of the award.
One Prize Per Team: A single team can compete in both tracks but may only claim the highest monetary prize earned across tracks (the unclaimed prize passes to the next rank).
Agent Reproducibility: For Track B, the organizers will re-run your agent 3 times. The median score will be used to mitigate non-deterministic LLM outputs.

Presentations

Teams who win the gold awards will be invited to present a ~10 minute informal overview of their approach at a special session held during the conference days. Presentations will be linked from the main MLSys competition website.

Contest Eligibility

All are welcome to participate in the contest -- including teams from academia, industry, and elsewhere -- with the exception of the Contest Organizers and employees of the Contest Sponsor. Individuals are prohibited from participating in multiple teams. In order to be eligible for prizes, contestants must commit to releasing an open-source version of their implementation 1 month after winning (allowing teams sufficient time to write & submit papers detailing their approach).

Frequently Asked Questions

To raise a question, please create an issue in this repository, or feel free to reach out to the contest organizers directly.

Reference Materials

ASPLOS 2025 contest track, which is a prequel of this problem.
Tensor processing primitives: a programming abstraction for efficiency and portability in deep learning workloads (2021)
Apollo: Automatic Partition-based Operator Fusion through Layer by Layer Optimization (2022)
Optimus: An Operator Fusion Framework for Deep Neural Networks (2022)
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (2022
Operator Fusion in XLA: Analysis and Evaluation (2023)
How Much Can We Gain From Tensor Kernel Fusion on GPUs? (2024)

Contest Organizers

Yarong Mu (ymu@google.com) & Weiren Yu (weirenyu@google.com)

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
benchmarks		benchmarks
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PROBLEM.md		PROBLEM.md
README.md		README.md
example_problem.json		example_problem.json
mlsys.h		mlsys.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Important Dates

Problem Description

Submission Categories & Format

Submission Mechanism

Track A: Systems Engineering (Human-Written)

Track B: Agent Reasoning (AI-Generated)

Resources and Compute

Getting Started

Benchmarks

Evaluation and Scoring

Prizes and Awards

Presentations

Contest Eligibility

Frequently Asked Questions

Reference Materials

Contest Organizers

Contest Sponsor

About

Uh oh!

Releases

Packages

Contributors 3

Languages

License

yarongmu-google/MLSys

Folders and files

Latest commit

History

Repository files navigation

Important Dates

Problem Description

Submission Categories & Format

Submission Mechanism

Track A: Systems Engineering (Human-Written)

Track B: Agent Reasoning (AI-Generated)

Resources and Compute

Getting Started

Benchmarks

Evaluation and Scoring

Prizes and Awards

Presentations

Contest Eligibility

Frequently Asked Questions

Reference Materials

Contest Organizers

Contest Sponsor

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages