wip: add HW requirements calculator #1216

ngxson · 2025-02-21T11:25:51Z

To be discussed with @Vaibhavs10

Demo:

mem {
  "name": "hexgrad/Kokoro-82M",
  "memory": {
    "minimumGigabytes": 1.991212226,
    "recommendedGigabytes": 2.1903334486
  }
}
mem {
  "name": "microsoft/OmniParser-v2.0",
  "memory": {
    "minimumGigabytes": 1.664,
    "recommendedGigabytes": 1.8304
  }
}
mem {
  "name": "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B",
  "memory": {
    "minimumGigabytes": 5.4170575452,
    "recommendedGigabytes": 5.95876329972
  }
}
mem {
  "name": "deepseek-ai/DeepSeek-R1-Distill-Llama-8B",
  "memory": {
    "minimumGigabytes": 20.424667624799998,
    "recommendedGigabytes": 22.467134387279998
  }
}
mem {
  "name": "NousResearch/DeepHermes-3-Llama-3-8B-Preview",
  "memory": {
    "minimumGigabytes": 20.4246676512,
    "recommendedGigabytes": 22.46713441632
  }
}
mem {
  "name": "unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit",
  "memory": {
    "minimumGigabytes": 8.309023701600001,
    "recommendedGigabytes": 9.139926071760001
  }
}

Vaibhavs10

Nice! I like this - some quick questions:

How do we fetch the users hardware from their Local Apps page?
Should we maybe start with just LLMs to begin with?

Quite excited to make this work!

cc: @julien-c for thoughts too

ngxson · 2025-02-27T10:11:55Z

How do we fetch the users hardware from their Local Apps page?

On moon-landing, the data is exposed via FrontData.ts, so adding the check if model is compatible with a given user hardware should be simple.

The idea is that this getHardwareRequirements provides a reference of how much RAM is needed, it doesn't need to know the notion of user hw. Then on moon-landing frontend, we implement this check against user hw.

Should we maybe start with just LLMs to begin with?

Yes, absolutely agree!

julien-c

So, we would focus on only RAM/VRAM in the beginning, correct? I agree with this approach, but i would maybe start with just GGUF, and later extend to other formats

julien-c · 2025-03-05T11:17:05Z

packages/hub/src/lib/hardware-requirements.ts

+	/**
+	 * The context size in tokens, default to 2048.
+	 */
+	contextSize?: number;


if possible contextSize should be taken in tasks.ModelData so it's clearer the data is coming from normalized parsing of HF model repos

and tbh i'm wondering if this kinda of method should better live in tasks or gguf modules rather than here

IMO the contextSize should be a number that most users gonna use in practice, for example 2048 or 4096

Setting it to the max contextSize of the model may not be useful, as most users never have enough VRAM to run full context length anyway (especially, 128k context are more and more common nowadays)

ok, i see your point. cc @gary149 wdyt?
maybe it's another selector in the UI then with "user-set context size"?

We could set an educated default of 8k tho (claude was 8k only for a long time)

wip: add HW requirements calculator

1df32d7

Vaibhavs10 reviewed Feb 26, 2025

View reviewed changes

julien-c reviewed Mar 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wip: add HW requirements calculator #1216

wip: add HW requirements calculator #1216

ngxson commented Feb 21, 2025 •

edited

Loading

Vaibhavs10 left a comment

ngxson commented Feb 27, 2025

julien-c left a comment •

edited

Loading

julien-c Mar 5, 2025

julien-c Mar 5, 2025

ngxson Mar 5, 2025

julien-c Mar 5, 2025

Vaibhavs10 Mar 5, 2025

wip: add HW requirements calculator #1216

Are you sure you want to change the base?

wip: add HW requirements calculator #1216

Conversation

ngxson commented Feb 21, 2025 • edited Loading

Vaibhavs10 left a comment

Choose a reason for hiding this comment

ngxson commented Feb 27, 2025

julien-c left a comment • edited Loading

Choose a reason for hiding this comment

julien-c Mar 5, 2025

Choose a reason for hiding this comment

julien-c Mar 5, 2025

Choose a reason for hiding this comment

ngxson Mar 5, 2025

Choose a reason for hiding this comment

julien-c Mar 5, 2025

Choose a reason for hiding this comment

Vaibhavs10 Mar 5, 2025

Choose a reason for hiding this comment

ngxson commented Feb 21, 2025 •

edited

Loading

julien-c left a comment •

edited

Loading