Skip to content
View nickaggarwal's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report nickaggarwal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. kserve kserve Public

    Forked from kserve/kserve

    Standardized Serverless ML Inference Platform on Kubernetes

    Python

  2. nvidia-triton-llm-streaming nvidia-triton-llm-streaming Public

    Integrating SSE with NVIDIA Triton Inference Server using a Python backend and Zephyr model. There is very less documentation how to use Nvidia Triton in Streaming use-cases ( hard to find in their…

    Python 10

  3. DeepSeek-R1-Distill-Qwen-32B DeepSeek-R1-Distill-Qwen-32B Public template

    Forked from inferless/deepseek-r1-distill-qwen-32b

    DeepSeek-R1-Distill-Qwen-32B is a distilled variant within the DeepSeek-R1 series. The dataset used for training is meticulously curated from the DeepSeek-R1 model, with Qwen2.5-32B serving as the …

    Python

  4. inferless/triton-co-pilot inferless/triton-co-pilot Public

    Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments

    Python 19 3

  5. inferless/whisper-large-v3 inferless/whisper-large-v3 Public template

    State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

    Python 15 12

  6. open-docs open-docs Public

    A documentation website built with React, TypeScript, Bootstrap, and MDX .

    MDX 3