DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications
Prices from
COMPARE ALL WEBSHOPS
(2)
Amazon
Pages: 288, Paperback, Independently published
Read more
26.57
Featured
|
£ 26.57 |
To Shop
|
|
£ 26.57 |
To Shop
|
Description
Amazon
Pages: 288, Paperback, Independently published
Pages: 288, Paperback, Independently published