Refactor IO config to yaml in assets #2311

xiaoyu-work · 2026-01-16T19:50:34Z

This pull request introduces a major refactor of the HuggingFace ONNX I/O configuration system in Olive. The changes move away from hardcoded Python class registries and instead adopt a YAML-driven approach for specifying input/output configurations for both tasks and diffusers pipelines. This enables easier extensibility and clearer, data-driven configuration management. Additionally, the update adds default dummy input shapes and rich configuration templates for tasks and diffusers components.

The most important changes are:

YAML-based IO Configuration System:

Added new YAML files (defaults.yaml, tasks.yaml, diffusers.yaml) in olive/assets/io_configs/ that define default dummy input shapes, task templates, and diffusers component/pipeline specifications for ONNX export. These files replace the previous Python-based registries and provide a more flexible and maintainable configuration system. [1] [2] [3]

Refactor of IO Config Python Interface:

Replaced the contents of olive/common/hf/io_config/__init__.py to remove all hardcoded class registries and mapping logic, and instead import new YAML-driven utility functions (generate_dummy_inputs, get_io_config, get_diffusers_io_config, etc.). This greatly simplifies the interface and shifts responsibility to the new YAML configuration system.

Legal and Organizational Improvements:

Added copyright and license headers to olive/assets/__init__.py and olive/assets/io_configs/__init__.py.## Describe your changes

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

olive/common/hf/io_config/input_generators.py

olive/common/hf/io_config/task_config.py

Copilot

Pull request overview

This pull request introduces a major refactor of the HuggingFace ONNX I/O configuration system, migrating from hardcoded Python class registries to a flexible YAML-driven approach. The changes enable easier extensibility and clearer configuration management for model optimization workflows.

Changes:

Replaced Python-based OnnxConfig class registries with YAML configuration files for tasks and diffusers components
Introduced unified dummy input generators that read specifications from YAML rather than class hierarchies
Consolidated I/O configuration generation into a single task-driven API with automatic optional input filtering based on model signatures

Reviewed changes

Copilot reviewed 29 out of 34 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
olive/assets/io_configs/tasks.yaml	Defines input/output specifications for 15+ HuggingFace tasks
olive/assets/io_configs/diffusers.yaml	Defines I/O specs for diffusers components (SD, SDXL, SD3, Flux, Sana)
olive/assets/io_configs/defaults.yaml	Provides default dimension values and attribute aliases for config resolution
olive/common/hf/io_config/yaml_loader.py	YAML loading and config resolution utilities with LRU caching
olive/common/hf/io_config/task_config.py	New unified API for generating I/O configs and dummy inputs
olive/common/hf/io_config/input_generators.py	Refactored to use YAML-driven input generation
olive/common/hf/model_io.py	Updated to use new task-driven API
olive/common/hf/wrapper.py	Migrated to use resolve_alias for config attribute access
test/common/hf/io_config/*.py	Updated tests for new YAML-driven system

Comments suppressed due to low confidence (1)

olive/common/hf/io_config/task_config.py:1

Missing check for whether 'with_past' key exists in task_spec before accessing it. This will raise a KeyError if a task doesn't define with_past overrides. Add a check: if self.use_past_in_inputs and 'with_past' in self.task_spec:

# -------------------------------------------------------------------------

olive/common/hf/io_config/input_generators.py

test/common/test_hf.py

olive/common/hf/io_config/input_generators.py

olive/assets/io_configs/defaults.yaml

olive/common/hf/io_config/io_resolver.py

olive/common/hf/io_config/task_config.py

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

docs/source/how-to/extending/how-to-add-new-task.md

olive/assets/io_configs/defaults.yaml

olive/common/hf/io_config/io_resolver.py

shaahji · 2026-01-21T18:49:16Z

Suggestion: Current implementation requires user to clone the git repo and modify it. This approach has challenges, most importantly dev vs. released version. Instead, rather than piling all the configuration in a single file, have one file hold configuration for one task. This will allow you to accept new task configuration on command line without explicitly modifying cloned repo. This approach also allows working with released version - which is always recommended compared to using a git clone.

P.S. That's the approach both lm-eval and bb implements.

xiaoyu-work · 2026-01-21T22:28:32Z

Suggestion: Current implementation requires user to clone the git repo and modify it. This approach has challenges, most importantly dev vs. released version. Instead, rather than piling all the configuration in a single file, have one file hold configuration for one task. This will allow you to accept new task configuration on command line without explicitly modifying cloned repo. This approach also allows working with released version - which is always recommended compared to using a git clone.

P.S. That's the approach both lm-eval and bb implements.

We now have 22 tasks, having separate config files for each task makes maintenance difficult. Most use cases would be adding new task IO rather than modifying existing ones. How about keeping the current configuration files, but also allowing override with external config files?

shaahji · 2026-01-21T22:47:59Z

Suggestion: Current implementation requires user to clone the git repo and modify it. This approach has challenges, most importantly dev vs. released version. Instead, rather than piling all the configuration in a single file, have one file hold configuration for one task. This will allow you to accept new task configuration on command line without explicitly modifying cloned repo. This approach also allows working with released version - which is always recommended compared to using a git clone.
P.S. That's the approach both lm-eval and bb implements.

We now have 22 tasks, having separate config files for each task makes maintenance difficult. Most use cases would be adding new task IO rather than modifying existing ones. How about keeping the current configuration files, but also allowing override with external config files?

How often do we expect to change any of these ones they are committed? For all intents and purposes, this is readonly data, unlikely to change ever. But if it makes it simpler, all for it.

On the implementation front, you could basically assume any number of tasks per file. So, there shouldn't be any difference whether you are loading the Olive released or the user provided one. Basically, the code path should remain the same.

xiaoyu-work · 2026-01-21T23:12:23Z

Suggestion: Current implementation requires user to clone the git repo and modify it. This approach has challenges, most importantly dev vs. released version. Instead, rather than piling all the configuration in a single file, have one file hold configuration for one task. This will allow you to accept new task configuration on command line without explicitly modifying cloned repo. This approach also allows working with released version - which is always recommended compared to using a git clone.
P.S. That's the approach both lm-eval and bb implements.

We now have 22 tasks, having separate config files for each task makes maintenance difficult. Most use cases would be adding new task IO rather than modifying existing ones. How about keeping the current configuration files, but also allowing override with external config files?

How often do we expect to change any of these ones they are committed? For all intents and purposes, this is readonly data, unlikely to change ever. But if it makes it simpler, all for it.

On the implementation front, you could basically assume any number of tasks per file. So, there shouldn't be any difference whether you are loading the Olive released or the user provided one. Basically, the code path should remain the same.

i think in real life, it is rare that user wants to provide his own io config for his own task. if a task is not supported by Olive, then we should add support it.

shaahji · 2026-01-23T18:52:15Z

i think in real life, it is rare that user wants to provide his own io config for his own task. if a task is not supported by Olive, then we should add support it.

Fair enough argument :) Let's just focus on what's easiest to maintain and extend.

setup.py

ooo

xiaoyu-work changed the title ~~Xiaoyu/io~~ Refactor IO config to assets Jan 16, 2026

xiaoyu-work changed the title ~~Refactor IO config to assets~~ Refactor IO config to yaml in assets Jan 16, 2026

github-advanced-security bot found potential problems Jan 16, 2026

View reviewed changes

olive/common/hf/io_config/input_generators.py Fixed Show fixed Hide fixed

olive/common/hf/io_config/input_generators.py Fixed Show fixed Hide fixed

olive/common/hf/io_config/task_config.py Fixed Show fixed Hide fixed

xiaoyu-work requested a review from Copilot January 16, 2026 21:34

Copilot started reviewing on behalf of xiaoyu-work January 16, 2026 21:35 View session

Copilot AI reviewed Jan 16, 2026

View reviewed changes

Refactor IO config to yaml in assets

51c3a9a

xiaoyu-work force-pushed the xiaoyu/io branch from b5aeb2d to 51c3a9a Compare January 20, 2026 19:18

github-advanced-security bot found potential problems Jan 20, 2026

View reviewed changes

olive/common/hf/io_config/task_config.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Jan 20, 2026

View reviewed changes

olive/common/hf/io_config/task_config.py Fixed Show fixed Hide fixed

xiaoyu-work and others added 3 commits January 20, 2026 12:38

Potential fix for code scanning alert no. 19325: Empty except

42c7394

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

fix format

be5d84e

add documentation

9c2359f

shaahji previously requested changes Jan 21, 2026

View reviewed changes

address comments

6cc10ce

fix test

597b627

xiaoyu-work marked this pull request as ready for review January 26, 2026 18:38

jambayk reviewed Jan 26, 2026

View reviewed changes

setup.py Show resolved Hide resolved

Merge branch 'main' into xiaoyu/io

5265047

jambayk approved these changes Jan 26, 2026

View reviewed changes

xiaoyu-work merged commit ca37f23 into main Jan 27, 2026
11 checks passed

xiaoyu-work deleted the xiaoyu/io branch January 27, 2026 00:04

Refactor IO config to yaml in assets #2311

Refactor IO config to yaml in assets #2311

Uh oh!

Conversation

xiaoyu-work commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shaahji commented Jan 21, 2026

Uh oh!

xiaoyu-work commented Jan 21, 2026

Uh oh!

shaahji commented Jan 21, 2026

Uh oh!

xiaoyu-work commented Jan 21, 2026

Uh oh!

shaahji commented Jan 23, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xiaoyu-work commented Jan 16, 2026 •

edited

Loading