Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Adding support for Ternary DiT models #10334

Open
4 tasks done
Lucky-Lance opened this issue Nov 16, 2024 · 1 comment
Open
4 tasks done

Feature Request: Adding support for Ternary DiT models #10334

Lucky-Lance opened this issue Nov 16, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@Lucky-Lance
Copy link

Prerequisites

  • I am running the latest code. Mention the version if possible as well.
  • I carefully followed the README.md.
  • I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

llama.cpp has already supported ternary quantization for LLMs, e.g., Bitnet b1.58. We have trained a Ternary diffusion transformer model TerDiT. Due to the limitations of our engineering abilities, I am wondering if llama.cpp can support the deployment of this model, this can help our research a lot.

Motivation

Ternary quantization has become popular and has demonstrated computational speedups and power reductions, as demonstrated in works like llama.cpp and bitnet.cpp. We trained the first ternary DiT network, DiT is a popular structure nowadays for text to image generation. We would like to know if we can be assisted in realizing the deployment of llama.cpp.

Possible Implementation

We have limited engineering abilities. The implementation of TerDiT is similar to LLaMA. We think the implementation of llama.cpp and bitnet.cpp can be helpful😊.

@Lucky-Lance Lucky-Lance added the enhancement New feature or request label Nov 16, 2024
@Lucky-Lance Lucky-Lance changed the title Feature Request: Feature Request: Adding support for Ternary DiT models Nov 16, 2024
@slaren
Copy link
Collaborator

slaren commented Nov 16, 2024

llama.cpp does not support image generation models, but I would suggest taking a look at https://github.com/leejet/stable-diffusion.cpp which is built using the same ggml library and could use the same ternary tensor types that are available in llama.cpp.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants