Skip to content
View javafa's full-sized avatar
๐Ÿ’ญ
I may be slow to respond.
๐Ÿ’ญ
I may be slow to respond.
  • Seoul, Korea
  • 10:02 (UTC +09:00)

Block or report javafa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
javafa/README.md

Turboquant Implementation

Baseline(FP16) ๋Œ€๋น„ ๋น„๊ต:

๊ตฌ์„ฑ VRAM KV Cache ์†๋„ (short) ์†๋„ (long)
BnB 4-bit -53.5% ๋™์ผ -28.6% -37.3%
TurboQuant 3-bit ๋™์ผ -39.9% +23.1% +5.3%
BnB 4-bit + TurboQuant -53.5% -39.9% -29.4% -38.5%
Unsloth 4-bit -42% ๋™์ผ -41.1% -17.5%
Unsloth 4-bit + TurboQuant -42% -39.9% -5.5% -12.7%

Ranked 1st on the huggingface Open Llm Leaderboard

  • This is a merged model. So you have to unckeck the "[X] Contains merge/moerge" checkbox above the list.
Metric Value
Avg. 81.28

free-evo-qwen72b-v0.8

Pinned Loading

  1. turboquant turboquant Public

    Google Research์˜ TurboQuant (arXiv:2504.19874) ๋…ผ๋ฌธ์„ PyTorch๋กœ ๊ตฌํ˜„ํ•œ ํ”„๋กœ์ ํŠธ์ž…๋‹ˆ๋‹ค.

    Python

  2. pwbs-paper pwbs-paper Public

    A structured prompting framework that applies Work Breakdown Structure (WBS) principles to LLM prompt engineering.

    TeX

  3. faceid faceid Public

    Lightweight Face Recognition

    Python 1

  4. thisiskotlin thisiskotlin Public

    Author of 'This is Kotlin' - ์ด๊ฒƒ์ด ์•ˆ๋“œ๋กœ์ด๋“œ๋‹ค with ์ฝ”ํ‹€๋ฆฐ - Arctic Fox

    Kotlin 50 46

  5. AndroidMathView AndroidMathView Public

    Katex TextView for android

    Kotlin 4

  6. SpringFCMSender SpringFCMSender Public

    Spring ์„ ์ด์šฉํ•ด์„œ FCM์„œ๋ฒ„ ๋ฉ”์‹œ์ง€ ์ „์†ก์„ ๊ตฌํ˜„ํ•ฉ๋‹ˆ๋‹ค

    Java 1