This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-16 10:26:07 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
neuron
/
1_core
History
Satyajith Chilappagari
e0cbad4e30
[Neuron] Support quantization on neuron (
#18283
)
...
Signed-off-by: Satyajith Chilappagari <satchill@amazon.com>
2025-05-27 22:10:33 +00:00
..
test_activation.py
…
test_block_table.py
[Misc] Replace os environ to monkeypatch in test suite (
#14516
)
2025-03-16 20:35:57 -07:00
test_cache.py
[Neuron][kernel] Fuse kv cache into a single tensor (
#15911
)
2025-04-03 09:51:32 -07:00
test_layernorm.py
…
test_logits_processor.py
…
test_neuron_model_runner.py
Add NeuronxDistributedInference support, Speculative Decoding, Dynamic on-device sampling (
#16357
)
2025-05-07 00:07:30 -07:00
test_neuron_quant.py
[Neuron] Support quantization on neuron (
#18283
)
2025-05-27 22:10:33 +00:00
test_prefix_prefill.py
[Neuron][kernel] Fuse kv cache into a single tensor (
#15911
)
2025-04-03 09:51:32 -07:00
test_rotary_embedding.py
Make key optional for rotary embedding (
#17566
)
2025-05-07 00:11:46 -07:00