Satyajith Chilappagari
043e4c4955
Add NeuronxDistributedInference support, Speculative Decoding, Dynamic on-device sampling (#16357)
Signed-off-by: Satyajith Chilappagari <satchill@amazon.com>
Co-authored-by: Aaron Dou <yzdou@amazon.com>
Co-authored-by: Shashwat Srijan <sssrijan@amazon.com>
Co-authored-by: Chongming Ni <chongmni@amazon.com>
Co-authored-by: Amulya Ballakur <amulyaab@amazon.com>
Co-authored-by: Patrick Lange <patlange@amazon.com>
Co-authored-by: Elaine Zhao <elaineyz@amazon.com>
Co-authored-by: Lin Lin Pan <tailinpa@amazon.com>
Co-authored-by: Navyadhara Gogineni <navyadha@amazon.com>
Co-authored-by: Yishan McNabb <yishanm@amazon.com>
Co-authored-by: Mrinal Shukla <181322398+mrinalks@users.noreply.github.com>
2025-05-07 00:07:30 -07:00
..
2025-04-15 08:05:30 +00:00
2025-04-17 13:22:40 -07:00
2025-02-08 04:25:15 -08:00
2025-03-02 17:34:51 -08:00
2025-05-02 03:29:25 -07:00
2025-04-28 10:05:00 +00:00
2025-04-16 22:19:26 -07:00
2025-02-02 11:58:18 -08:00
2025-04-15 08:05:30 +00:00
2025-04-14 09:59:15 +00:00
2025-04-29 21:10:00 +00:00
2025-04-15 08:05:30 +00:00
2025-04-15 08:05:30 +00:00
2025-04-17 04:17:39 +00:00
2025-04-15 08:05:30 +00:00
2025-04-16 10:16:36 +00:00
2025-04-03 07:32:10 +00:00
2025-05-06 23:10:37 -07:00
2025-04-19 12:13:06 +00:00
2025-04-15 08:05:30 +00:00
2025-04-08 10:42:32 +00:00
2025-05-07 00:07:30 -07:00
2025-04-08 10:42:32 +00:00
2025-05-07 00:07:30 -07:00
2025-04-08 10:42:32 +00:00
2025-04-08 10:42:32 +00:00
2025-04-15 08:05:30 +00:00
2025-05-03 19:42:43 -07:00
2025-04-08 10:42:32 +00:00
2025-03-07 00:32:46 +08:00
2025-03-07 00:32:46 +08:00
2025-04-08 10:42:32 +00:00
2025-04-15 08:05:30 +00:00
2025-04-15 08:05:30 +00:00
2025-04-09 23:32:42 +00:00
2025-04-08 10:42:32 +00:00
2025-05-06 13:55:04 -04:00
2025-04-15 08:05:30 +00:00
2025-05-01 11:00:53 -07:00
2025-05-06 16:12:28 +00:00