Satyajith Chilappagari
043e4c4955
Add NeuronxDistributedInference support, Speculative Decoding, Dynamic on-device sampling (#16357)
Signed-off-by: Satyajith Chilappagari <satchill@amazon.com>
Co-authored-by: Aaron Dou <yzdou@amazon.com>
Co-authored-by: Shashwat Srijan <sssrijan@amazon.com>
Co-authored-by: Chongming Ni <chongmni@amazon.com>
Co-authored-by: Amulya Ballakur <amulyaab@amazon.com>
Co-authored-by: Patrick Lange <patlange@amazon.com>
Co-authored-by: Elaine Zhao <elaineyz@amazon.com>
Co-authored-by: Lin Lin Pan <tailinpa@amazon.com>
Co-authored-by: Navyadhara Gogineni <navyadha@amazon.com>
Co-authored-by: Yishan McNabb <yishanm@amazon.com>
Co-authored-by: Mrinal Shukla <181322398+mrinalks@users.noreply.github.com>
2025-05-07 00:07:30 -07:00
..
2025-05-01 15:05:24 +00:00
2025-05-07 00:07:30 -07:00
2025-05-06 16:10:23 +00:00
2025-03-13 20:33:09 -07:00
2023-11-30 16:43:13 -08:00
2024-03-15 21:02:12 -07:00
2024-07-27 11:53:07 +00:00
2024-03-14 23:19:22 -07:00
2024-03-14 23:19:22 -07:00
2023-11-30 16:43:13 -08:00
2025-03-11 04:37:11 +00:00
2024-11-13 08:28:13 +00:00
2024-03-14 23:19:02 -07:00
2024-03-14 23:19:02 -07:00
2025-04-07 04:04:02 -07:00
2023-11-30 16:43:13 -08:00
2024-06-07 11:23:32 -07:00
2025-03-22 02:04:44 -07:00
2024-11-01 14:09:07 +00:00
2024-10-29 15:07:37 -07:00
2024-11-19 18:16:54 -07:00
2024-09-07 10:49:01 +08:00
2024-10-04 10:36:39 +08:00
2024-11-23 10:17:38 +08:00
2024-12-05 05:54:06 +00:00
2025-04-06 07:44:36 +00:00
2025-04-24 20:19:36 +00:00
2025-04-12 06:26:17 +08:00
2025-04-28 19:53:44 -07:00
2024-10-03 03:04:17 +00:00
2024-09-04 13:18:13 -07:00
2025-03-31 22:50:05 -07:00
2025-04-06 07:44:36 +00:00