Feature Request: Adding support for Ternary DiT models #10334

Lucky-Lance · 2024-11-16T13:26:44Z

Prerequisites

I am running the latest code. Mention the version if possible as well.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

llama.cpp has already supported ternary quantization for LLMs, e.g., Bitnet b1.58. We have trained a Ternary diffusion transformer model TerDiT. Due to the limitations of our engineering abilities, I am wondering if llama.cpp can support the deployment of this model, this can help our research a lot.

Motivation

Ternary quantization has become popular and has demonstrated computational speedups and power reductions, as demonstrated in works like llama.cpp and bitnet.cpp. We trained the first ternary DiT network, DiT is a popular structure nowadays for text to image generation. We would like to know if we can be assisted in realizing the deployment of llama.cpp.

Possible Implementation

We have limited engineering abilities. The implementation of TerDiT is similar to LLaMA. We think the implementation of llama.cpp and bitnet.cpp can be helpful😊.

slaren · 2024-11-16T14:10:03Z

llama.cpp does not support image generation models, but I would suggest taking a look at https://github.com/leejet/stable-diffusion.cpp which is built using the same ggml library and could use the same ternary tensor types that are available in llama.cpp.

Lucky-Lance added the enhancement New feature or request label Nov 16, 2024

Lucky-Lance changed the title ~~Feature Request:~~ Feature Request: Adding support for Ternary DiT models Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Adding support for Ternary DiT models #10334

Feature Request: Adding support for Ternary DiT models #10334

Lucky-Lance commented Nov 16, 2024

slaren commented Nov 16, 2024

Feature Request: Adding support for Ternary DiT models #10334

Feature Request: Adding support for Ternary DiT models #10334

Comments

Lucky-Lance commented Nov 16, 2024

Prerequisites

Feature Description

Motivation

Possible Implementation

slaren commented Nov 16, 2024