Skip to content
Change the repository type filter

All

    Repositories list

    • triton

      Public
      Development repository for the Triton language and compiler
      Python
      MIT License
      1.8k105949Updated Feb 7, 2025Feb 7, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      13k132198Updated Feb 7, 2025Feb 7, 2025
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      MIT License
      511404912Updated Feb 7, 2025Feb 7, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2411.1k24751Updated Feb 7, 2025Feb 7, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k153216Updated Feb 7, 2025Feb 7, 2025
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      74k6906775Updated Feb 7, 2025Feb 7, 2025
    • jax

      Public
      Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
      Python
      Apache License 2.0
      2.9k19018Updated Feb 7, 2025Feb 7, 2025
    • A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high-performance computing environments
      C++
      MIT License
      406708Updated Feb 7, 2025Feb 7, 2025
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      4034.9k10415Updated Feb 7, 2025Feb 7, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      9520735350Updated Feb 7, 2025Feb 7, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1850111Updated Feb 7, 2025Feb 7, 2025
    • Jupyter Notebook
      104710Updated Feb 7, 2025Feb 7, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1453392452Updated Feb 7, 2025Feb 7, 2025
    • rocPyDecode is a set of Python bindings to rocDecode C++ library which provides full HW acceleration for video decoding on AMD GPUs.
      C++
      MIT License
      8312Updated Feb 7, 2025Feb 7, 2025
    • ONNX Runtime: cross-platform, high performance scoring engine for ML models
      C++
      MIT License
      3k6010Updated Feb 7, 2025Feb 7, 2025
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      48211242Updated Feb 7, 2025Feb 7, 2025
    • rocSHMEM

      Public
      rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
      C++
      MIT License
      124885Updated Feb 7, 2025Feb 7, 2025
    • hipSOLVER

      Public
      ROCm SOLVER marshalling library
      C++
      MIT License
      262404Updated Feb 7, 2025Feb 7, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2216044Updated Feb 7, 2025Feb 7, 2025
    • A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
      Python
      Other
      2.3k3017Updated Feb 7, 2025Feb 7, 2025
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      Other
      171382Updated Feb 7, 2025Feb 7, 2025
    • HIP

      Public
      HIP: C++ Heterogeneous-Compute Interface for Portability
      C++
      MIT License
      5493.9k1944Updated Feb 7, 2025Feb 7, 2025
    • vision

      Public
      Datasets, Transforms and Models specific to Computer Vision
      Python
      BSD 3-Clause "New" or "Revised" License
      7k100Updated Feb 7, 2025Feb 7, 2025
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      17336143Updated Feb 7, 2025Feb 7, 2025
    • rocminfo

      Public
      ROCm Application for Reporting System Info
      C++
      Other
      3237011Updated Feb 7, 2025Feb 7, 2025
    • ROCm Systems Profiler
      C++
      MIT License
      61509Updated Feb 7, 2025Feb 7, 2025
    • aiter

      Public
      AI Tensor Engine for ROCm
      Cuda
      MIT License
      41855Updated Feb 7, 2025Feb 7, 2025
    • Python
      Other
      81683Updated Feb 7, 2025Feb 7, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.5k63529Updated Feb 7, 2025Feb 7, 2025
    • rocHPL

      Public
      High Performance Linpack for Next-Generation AMD HPC Accelerators
      C++
      Other
      204553Updated Feb 7, 2025Feb 7, 2025