-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: triton-inference-server/server
Author
Label
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Configurable grpc infer thread count
#8061
opened Mar 10, 2025 by
yinggeh
Loading…
1 of 11 tasks
feat: Add OpenAI frontend multi-LoRA model listing
PR: feat
A new feature
#8052
opened Mar 4, 2025 by
kthui
Loading…
9 of 20 tasks
Build: Build using the PA binaries and whl if available.
#8043
opened Feb 27, 2025 by
pvijayakrish
Loading…
8 of 20 tasks
test: Add OpenAI frontend testing for LLM API backend
PR: test
Adding missing tests or correcting existing test
feat: Add multi-LoRA support to OpenAI frontend
PR: feat
A new feature
#8038
opened Feb 26, 2025 by
kthui
Loading…
9 of 20 tasks
build: Removed workaround to install libboost-dev. Back to apt-get install
build
Issues pertaining to builds
#8037
opened Feb 26, 2025 by
dmitry-tokarev-nv
Loading…
4 of 20 tasks
Adding multiple tokenizers specification for open ai frontend
#8027
opened Feb 21, 2025 by
oandreeva-nv
•
Draft
22 tasks
ci: Fix L0_batch related flaky tests
PR: ci
Changes to our CI configuration files and scripts
#7999
opened Feb 10, 2025 by
yinggeh
Loading…
6 of 11 tasks
feat: Add graceful shutdown timer to GRPC frontend
enhancement
New feature or request
grpc
Related to the GRPC server
#7969
opened Jan 27, 2025 by
mattwittwer
Loading…
8 of 20 tasks
Separate model generation for backends on blackwell clusters
#7966
opened Jan 24, 2025 by
pvijayakrish
Loading…
3 of 20 tasks
docs: update to fix autoscaling example command
#7883
opened Dec 16, 2024 by
mattwittwer
•
Draft
20 tasks
feat: ORCA Format KV Cache Utilization in Inference Response Header
#7839
opened Nov 27, 2024 by
BenjaminBraunDev
Loading…
12 of 22 tasks
refactor: Refactor of L0_backend_python and the env subtest
PR: ci
Changes to our CI configuration files and scripts
PR: refactor
A code change that neither fixes a bug nor adds a feature
#7838
opened Nov 27, 2024 by
nv-kmcgill53
•
Draft
5 of 20 tasks
ci: Enables testing for pull requests
#7828
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
3 of 20 tasks
test: Updates L0 Python API tests to run all test files
#7827
opened Nov 23, 2024 by
pranavm-nvidia
Loading…
4 of 20 tasks
fix: Default max tokens to None for OpenAI frontend.
#7819
opened Nov 20, 2024 by
thealmightygrant
Loading…
4 of 22 tasks
feat: Adding RestrictedFeatures Support to the Python Frontend Bindings
#7775
opened Nov 8, 2024 by
KrishnanPrash
Loading…
docs: Add clarification for label_filename in classification docs
#7766
opened Nov 5, 2024 by
trevoryao
Loading…
7 of 22 tasks
docs: Simplify PR templates
PR: docs
Documentation only changes
#7753
opened Oct 29, 2024 by
yinggeh
Loading…
6 of 11 tasks
[Do not merge!] Build: Remove TRT model generation for V100
#7712
opened Oct 16, 2024 by
pvijayakrish
•
Draft
3 of 20 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.