[MODULE] A module on quantization #169

michaelshekasta · 2025-01-12T15:03:12Z

I’d like to propose a new module aimed at optimizing language models for efficient CPU-based inference, reducing reliance on GPUs. The module covers three key areas: quantization techniques, the GGUF model format, and utilizing Intel and MLX accelerators for optimized inference.

What are you thinking?

burtenshaw · 2025-01-15T13:08:16Z

Hi @michaelshekasta . Sorry to go quiet on this. I've been wrapped up on an agents course for HF learn this week. I will review it tomorrow.

michaelshekasta · 2025-01-16T07:51:18Z

@burtenshaw a gentle reminder

burtenshaw · 2025-01-16T10:22:06Z

@michaelshekasta This is a great start. I've implemented a more typical structure. I would suggest that you now follow on with next stage:

find references for each section of the module.
add them to the references section of the markdown pages.
add bullet point note to each section of the page with key topics.
highlight sections that you don't understand or need help on

Once you're ready, I'll review and complete the module's prose.

typo

burtenshaw

I would suggest moving on to the notebook and implementing two very simple notebooks where you:

use llamacpp
use a cpu inference of your choice

I will take a pass at the existing prose in fundamentals and update that.

burtenshaw · 2025-01-24T10:19:40Z

8_Quantization/fundamentals.md

+Need to look for more resources
+
+## Exercise Notebooks
+I'm unsure about what exactly we should include here. Below are a few options, along with my humble thoughts:


If you look at the other modules you'll see table with example notebooks. In this module we will need 2. One on GGUF and one on CPU inference.

burtenshaw · 2025-01-24T10:20:24Z

8_Quantization/readme.md

+| Title | Description | Exercise | Link | Colab |
+|-------|-------------|----------|------|-------|
+| Quantization with LlamaCPP | Description| Exercise| [link](./notebooks/example.ipynb) | <a target="_blank" href="link"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a> |
+| CPU Inference (Intel or MLX) | Description| Exercise| [link](./notebooks/example.ipynb) | <a target="_blank" href="link"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a> |


This table is sufficient. You can remove the mention of exercise notebooks in sub pages and replace with links.

michaelshekasta added 3 commits January 12, 2025 16:41

Create 8 - quantization

2f9cacc

Delete 8 - quantization

43c6272

Quantization draft

dfd7840

michaelshekasta marked this pull request as draft January 12, 2025 15:03

burtenshaw changed the title ~~Draft!! Quantization~~ [MODULE] A module on quantization Jan 16, 2025

burtenshaw added 4 commits January 16, 2025 10:42

make directory naming consistent

7a84cc1

update structure in readme

00c027f

update readme with structure

59f3045

update sub pages with structure

12fc3fc

michaelshekasta added 6 commits January 16, 2025 13:07

Merge branch 'huggingface:main' into main

b2a75f2

Update fundamentals.md

7951134

Update fundamentals.md

d9723ea

Update fundamentals.md

b6793fa

typo

Update fundamentals.md

1b94c08

Update fundamentals.md

021bbf0

burtenshaw reviewed Jan 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MODULE] A module on quantization #169

[MODULE] A module on quantization #169

michaelshekasta commented Jan 12, 2025 •

edited

Loading

burtenshaw commented Jan 15, 2025

michaelshekasta commented Jan 16, 2025

burtenshaw commented Jan 16, 2025

burtenshaw left a comment

burtenshaw Jan 24, 2025

burtenshaw Jan 24, 2025

[MODULE] A module on quantization #169

Are you sure you want to change the base?

[MODULE] A module on quantization #169

Conversation

michaelshekasta commented Jan 12, 2025 • edited Loading

burtenshaw commented Jan 15, 2025

michaelshekasta commented Jan 16, 2025

burtenshaw commented Jan 16, 2025

burtenshaw left a comment

Choose a reason for hiding this comment

burtenshaw Jan 24, 2025

Choose a reason for hiding this comment

burtenshaw Jan 24, 2025

Choose a reason for hiding this comment

michaelshekasta commented Jan 12, 2025 •

edited

Loading