Skip to content
Change the repository type filter

All

    Repositories list

    • axs

      Public
      KRAI X workflow automation system
      Python
      MIT License
      2401Updated Nov 18, 2024Nov 18, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.6k000Updated Nov 15, 2024Nov 15, 2024
    • Automated KRAI X workflows for reproducing MLPerf Inference submissions
      Python
      MIT License
      11111Updated Nov 13, 2024Nov 13, 2024
    • kilt4uai

      Public
      A plugin for KILT (KRAI Inference Library Technology) for integration with Untether AI's imAIgine SDK
      C++
      MIT License
      0000Updated Aug 6, 2024Aug 6, 2024
    • axs2uai

      Public
      Automated KRAI X workflows for reproducing MLPerf Inference submissions on systems with Untether AI's speedAI at-memory compute inference accelerators
      Python
      MIT License
      0000Updated Aug 6, 2024Aug 6, 2024
    • Automated KRAI X workflows for reproducing MLPerf Inference submissions on systems withAutomated KRAI X workflows for reproducing MLPerf Inference submissions with KILT ONNXRT CPU/GPU backends
      MIT License
      0000Updated Jul 26, 2024Jul 26, 2024
    • axs2kilt

      Public
      Automated KRAI X workflows for reproducing MLPerf Inference submissions powered by KRAI Inference Library Technology (KILT)
      Python
      MIT License
      1100Updated Jul 26, 2024Jul 26, 2024
    • Automated KRAI X workflows for reproducing MLPerf Inference submissions
      Python
      MIT License
      0000Updated Jul 26, 2024Jul 26, 2024
    • KILT (KRAI Inference Library Technology) - proudly powering some of the fastest and most energy efficient submissions in the history of MLPerf Inference
      C++
      MIT License
      1100Updated Jul 26, 2024Jul 26, 2024
    • inference

      Public
      Reference implementations of inference benchmarks
      Python
      Apache License 2.0
      536000Updated Jun 21, 2024Jun 21, 2024
    • policies

      Public
      General policies for MLPerf™ including submission rules, coding standards, etc.
      Python
      Apache License 2.0
      54000Updated Jun 11, 2024Jun 11, 2024
    • axs2qaic

      Public
      Automated KRAI X workflows for reproducing MLPerf Inference submissions on systems equipped with Qualcomm Cloud AI 100 accelerators
      Python
      MIT License
      0000Updated Apr 10, 2024Apr 10, 2024
    • Building Docker images for reproducing MLPerf Inference submissions with Qualcomm Cloud AI 100 accelerators
      Shell
      Other
      0000Updated Apr 10, 2024Apr 10, 2024
    • Issues related to MLPerf™ Inference policies, including rules and suggested changes
      Apache License 2.0
      52000Updated Apr 2, 2024Apr 2, 2024
    • axs2gcp

      Public
      Automated KRAI X workflows for Google Cloud Platform
      Python
      MIT License
      0300Updated Mar 14, 2024Mar 14, 2024
    • Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low latency across Computer Vision, Object Detection, Natural Language Processing and Generative AI models.
      Jupyter Notebook
      Other
      7000Updated Mar 12, 2024Mar 12, 2024
    • MIT License
      0000Updated Jan 29, 2024Jan 29, 2024
    • power-dev

      Public
      Dev repo for power measurement for the MLPerf benchmarks
      Python
      Apache License 2.0
      24000Updated Jan 26, 2024Jan 26, 2024
    • LLM_Wiki

      Public
      This is just a place for us to put whatever we’ve learnt about LLMs, be it papers, blog posts or our own experiences.
      0100Updated Nov 29, 2023Nov 29, 2023
    • TextToJSONConverter
      HTML
      0000Updated Oct 25, 2023Oct 25, 2023
    • TextLineConverter for NLP
      HTML
      0000Updated Oct 25, 2023Oct 25, 2023
    • This repository contains the results and code for the MLPerf™ Inference v3.1 benchmark.
      Apache License 2.0
      13000Updated Oct 10, 2023Oct 10, 2023
    • Prune a model while finetuning or training.
      Jupyter Notebook
      Apache License 2.0
      58000Updated Sep 26, 2023Sep 26, 2023
    • axs2snpe

      Public
      MIT License
      0000Updated Sep 5, 2023Sep 5, 2023
    • Python
      MIT License
      0000Updated Aug 18, 2023Aug 18, 2023
    • ck-mlperf

      Public archive
      Automated workflows for MLPerf, the industry-leading benchmark for evaluating performance of ML software and hardware
      Jupyter Notebook
      BSD 3-Clause "New" or "Revised" License
      23400Updated Aug 1, 2023Aug 1, 2023
    • Python
      0010Updated Jun 9, 2023Jun 9, 2023
    • ck-qaic

      Public archive
      Qualcomm Cloud AI (QAIC) implementation of MLPerf Inference benchmarks
      C++
      5800Updated Jun 7, 2023Jun 7, 2023
    • ck-armnn

      Public archive
      Collective Knowledge workflows for ArmNN
      Shell
      BSD 3-Clause "New" or "Revised" License
      10110Updated Apr 3, 2023Apr 3, 2023
    • axs repository for KILT
      Python
      MIT License
      0100Updated Mar 7, 2023Mar 7, 2023