DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Independently Published
DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Image of DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Prices from

26.57

Featured

	£ 26.57	To Shop
	£ 26.57	To Shop

Description

Amazon Pages: 288, Paperback, Independently published

Compare webshops (2)

Shop

Price

£ 26.57

To Shop

£ 26.57

To Shop

Description (1)

Pages: 288, Paperback, Independently published

Brand	Independently Published
EAN	9798274507356

Prices were last updated on: 03-06-2026, 02:04

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

£ 21.68

Compare 2 stores 2 stores

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

£ 12.55

Compare 2 stores 2 stores

Independently Published

LLM Engineer: Build, Fine-Tune, and Deploy Production-Grade AI Applications with Python Modern LLMs

£ 25.92

Compare 2 stores 2 stores

Independently Published

LLM Engineer: Build, Fine-Tune, and Deploy Production-Grade AI Applications with Python Modern LLMs

£ 18.51

Compare 2 stores 2 stores

Featured Choice

£ 26.57

To Shop

DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Description

Product specifications