Skip to content
@AI-Hypercomputer

AI-Hypercomputer

Reference implementations, benchmarks, recipes, and all things Google Cloud AI Hypercomputer

Popular repositories Loading

  1. maxtext maxtext Public

    A simple, performant and scalable Jax LLM!

    Python 1.6k 326

  2. JetStream JetStream Public

    JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

    Python 288 36

  3. maxdiffusion maxdiffusion Public

    Python 192 23

  4. xpk xpk Public

    xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

    Python 106 32

  5. jetstream-pytorch jetstream-pytorch Public

    PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

    Python 53 17

  6. gpu-recipes gpu-recipes Public

    Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

    Shell 41 7

Repositories

Showing 10 of 18 repositories
  • tpu-recipes Public
    AI-Hypercomputer/tpu-recipes’s past year of commit activity
    Shell 13 Apache-2.0 10 3 7 Updated Mar 6, 2025
  • maxtext Public

    A simple, performant and scalable Jax LLM!

    AI-Hypercomputer/maxtext’s past year of commit activity
    Python 1,639 Apache-2.0 326 35 (2 issues need help) 150 Updated Mar 6, 2025
  • kithara Public
    AI-Hypercomputer/kithara’s past year of commit activity
    Python 4 Apache-2.0 2 0 2 Updated Mar 6, 2025
  • gpu-recipes Public

    Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

    AI-Hypercomputer/gpu-recipes’s past year of commit activity
    Shell 41 Apache-2.0 7 1 0 Updated Mar 6, 2025
  • xpk Public

    xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

    AI-Hypercomputer/xpk’s past year of commit activity
    Python 106 Apache-2.0 32 14 28 Updated Mar 6, 2025
  • JetStream Public

    JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

    AI-Hypercomputer/JetStream’s past year of commit activity
    Python 288 Apache-2.0 36 11 7 Updated Mar 6, 2025
  • ray-tpu Public
    AI-Hypercomputer/ray-tpu’s past year of commit activity
    Python 5 Apache-2.0 3 2 0 Updated Mar 5, 2025
  • maxdiffusion Public
    AI-Hypercomputer/maxdiffusion’s past year of commit activity
    Python 192 Apache-2.0 23 4 (1 issue needs help) 8 Updated Mar 5, 2025
  • AI-Hypercomputer/accelerator-microbenchmarks’s past year of commit activity
    Python 0 Apache-2.0 1 0 0 Updated Mar 4, 2025
  • torchprime Public

    TorchPrime is a reference model implementation for PyTorch on TPU/GPU.

    AI-Hypercomputer/torchprime’s past year of commit activity
    Python 11 1 45 (4 issues need help) 6 Updated Mar 4, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.