Ep Context Model generated with external data is still dependent on the same data file #23358

BoarQing · 2025-01-14T10:12:22Z

Describe the issue

We found when the Ep Context Model is generated from a model with an external data, if an OP is not at the provider domain, it would still read to the original data file. This is undesirable since the external data is quite big.

Maybe MSFT can trim the external data for the new Ep Context Model to only contains those OPs' weights?
Or Maybe MSFT can just embed those weights into the new Ep Context Model?

To reproduce

Generate an Ep context model on a model with external data.
Delete the external data.
Inferencing the Ep context model would crash for not finding the external data.

Urgency

It is important for MSFT release.

Platform

Windows

OS Version

Windows11

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

main

ONNX Runtime API

C++

Architecture

X64

Execution Provider

Vitis AI

Execution Provider Library Version

No response

BoarQing · 2025-01-14T10:12:43Z

@snnn @jywu-msft @HectorSVC

BoarQing · 2025-01-15T07:36:04Z

#23374 A temporary solution to move all the tensor into the .onnx.

github-actions bot added the ep:VitisAI issues related to Vitis AI execution provider label Jan 14, 2025

jywu-msft assigned HectorSVC Jan 14, 2025

BoarQing mentioned this issue Jan 15, 2025

Ep model merge with external data. #23374

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ep Context Model generated with external data is still dependent on the same data file #23358

Ep Context Model generated with external data is still dependent on the same data file #23358

BoarQing commented Jan 14, 2025

BoarQing commented Jan 14, 2025

BoarQing commented Jan 15, 2025

Ep Context Model generated with external data is still dependent on the same data file #23358

Ep Context Model generated with external data is still dependent on the same data file #23358

Comments

BoarQing commented Jan 14, 2025

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

BoarQing commented Jan 14, 2025

BoarQing commented Jan 15, 2025