aws-navyadhara

Navyadhara Gogineni aws-navyadhara

Achievements

vllm-project/vllm-neuron vllm-project/vllm-neuron Public

Community maintained hardware plugin for vLLM on AWS Neuron

Python 16 4
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66.3k 12.2k
aws-neuron/upstreaming-to-vllm aws-neuron/upstreaming-to-vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25 13