Skip to content
Change the repository type filter

All

    Repositories list

    • FastVideo

      Public
      FastVideo is a lightweight framework for accelerating large video diffusion models.
      Python
      Apache License 2.0
      711.2k286Updated Mar 7, 2025Mar 7, 2025
    • Website for CSE 234, Winter 2025
      SCSS
      Other
      46706Updated Mar 7, 2025Mar 7, 2025
    • [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
      Python
      Apache License 2.0
      731.2k331Updated Mar 6, 2025Mar 6, 2025
    • Python
      352300Updated Mar 1, 2025Mar 1, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      6.1k001Updated Feb 28, 2025Feb 28, 2025
    • HTML
      8110Updated Feb 22, 2025Feb 22, 2025
    • Dynasor

      Public
      Simple extension on vLLM to help you speed up reasoning model without training.
      Python
      MIT License
      1711960Updated Feb 21, 2025Feb 21, 2025
    • [ICML 2024] CLLMs: Consistency Large Language Models
      Python
      Apache License 2.0
      1838170Updated Nov 16, 2024Nov 16, 2024
    • vllm-ltr

      Public
      [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank
      Python
      Apache License 2.0
      84100Updated Nov 4, 2024Nov 4, 2024
    • MuxServe

      Public
      Jupyter Notebook
      45220Updated Jun 13, 2024Jun 13, 2024
    • dsc291-PA

      Public
      Jupyter Notebook
      3200Updated Jun 6, 2024Jun 6, 2024
    • Website for DSC 291, Spring 2024
      SCSS
      Other
      46000Updated Jun 5, 2024Jun 5, 2024
    • Website for DSC 204a, Winter 2024
      SCSS
      Other
      46801Updated Mar 24, 2024Mar 24, 2024