[Performance] Multithreading for DequantizeLinear #23395

tarekziade · 2025-01-16T11:44:15Z

Describe the issue

The current DequantizeLinear CPU operator does not use threads.

I have implemented a quick prototype that shows a 4x speed up on that operator when used with a Qwen 2.5 0.5B model

I do see a comment about this:

https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/providers/cpu/quantization/quantize_linear.cc#L302

@fajin-corp is this something you were planning to implement? I'd be happy to help under your guidance

To reproduce

n/a

Urgency

No response

Platform

Windows

OS Version

any

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

main

ONNX Runtime API

Python

Architecture

X64

Execution Provider

Default CPU

Execution Provider Library Version

No response

Model File

No response

Is this a quantized model?

Yes

yuslepukhin · 2025-01-16T17:08:35Z

Go ahead and PR it.

fajin-corp · 2025-01-16T18:50:11Z

@tarekziade I'm not working on it. You are very welcome to open a PR for it.

tarekziade added the performance issues related to performance regressions label Jan 16, 2025

github-actions bot added the quantization issues related to quantization label Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Performance] Multithreading for DequantizeLinear #23395

[Performance] Multithreading for DequantizeLinear #23395

tarekziade commented Jan 16, 2025

yuslepukhin commented Jan 16, 2025

fajin-corp commented Jan 16, 2025

[Performance] Multithreading for DequantizeLinear #23395

[Performance] Multithreading for DequantizeLinear #23395

Comments

tarekziade commented Jan 16, 2025

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

Model File

Is this a quantized model?

yuslepukhin commented Jan 16, 2025

fajin-corp commented Jan 16, 2025