DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Independently Published
DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Image of DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Prices from

38.01

Featured

	£ 38.01	To Shop
	£ 38.01	To Shop

Description

Amazon Pages: 288, Hardcover, Independently published

Compare webshops (2)

Shop

Price

£ 38.01

To Shop

£ 38.01

To Shop

Description (1)

Pages: 288, Hardcover, Independently published

Brand	Independently Published
EAN	9798274508001

Prices were last updated on: 03-06-2026, 22:40

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

£ 21.68

Compare 2 stores 2 stores

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

£ 12.55

Compare 2 stores 2 stores

Independently Published

LLM Engineer: Build, Fine-Tune, and Deploy Production-Grade AI Applications with Python Modern LLMs

£ 25.92

Compare 2 stores 2 stores

Independently Published

LLM Engineer: Build, Fine-Tune, and Deploy Production-Grade AI Applications with Python Modern LLMs

£ 18.51

Compare 2 stores 2 stores

Featured Choice

£ 38.01

To Shop

DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Description

Product specifications