Intel LLM Scaler vLLM Update Supports More Models

www.phoronix.com

Intel LLM Scaler vLLM Update Supports More Models

www.phoronix.com

cm0002 to

AI - Artificial intelligence@programming.devEnglish · 4 months ago

Intel software engineers continue to be hard at work on LLM-Scaler as their solution for running vLLM on Intel GPUs in a Docker containerized environment. A new beta release of LLM-Scaler built around vLLM was released overnight with support for running more large language models.

Since the “LLM-Scaler 1.0” debut of the project back in August there have been frequent updates for expanding LLM coverage on Intel GPUs and exposing more features for harnessing the AI compute power on Intel graphics hardware. The versioning scheme though remains a mess with today’s test version being “llm-scaler-vllm beta release 0.10.2-b6” even with “1.0” previously being announced.

You must log in or # to comment.

Chat

AI - Artificial intelligence@programming.dev

Aii@programming.dev

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !Aii@programming.dev

AI related news and articles.

Rules:

No Videos.
No self promotion: Don’t post links to your articles.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

8 users / day
62 users / week
113 users / month
823 users / 6 months
2 local subscribers
254 subscribers
225 Posts
166 Comments
Modlog