mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-10 07:15:01 +08:00
Signed-off-by: Didier Durand <durand.didier@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
42 lines
1.3 KiB
Markdown
42 lines
1.3 KiB
Markdown
# Anything LLM
|
|
|
|
[Anything LLM](https://github.com/Mintplex-Labs/anything-llm) is a full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting.
|
|
|
|
It allows you to deploy a large language model (LLM) server with vLLM as the backend, which exposes OpenAI-compatible endpoints.
|
|
|
|
## Prerequisites
|
|
|
|
- Setup vLLM environment
|
|
|
|
## Deploy
|
|
|
|
- Start the vLLM server with the supported chat completion model, e.g.
|
|
|
|
```bash
|
|
vllm serve Qwen/Qwen1.5-32B-Chat-AWQ --max-model-len 4096
|
|
```
|
|
|
|
- Download and install [Anything LLM desktop](https://anythingllm.com/desktop).
|
|
|
|
- On the bottom left of open settings, AI Providers --> LLM:
|
|
- LLM Provider: Generic OpenAI
|
|
- Base URL: http://{vllm server host}:{vllm server port}/v1
|
|
- Chat Model Name: `Qwen/Qwen1.5-32B-Chat-AWQ`
|
|
|
|

|
|
|
|
- Back to home page, New Workspace --> create `vllm` workspace, and start to chat:
|
|
|
|

|
|
|
|
- Click the upload button:
|
|
- upload the doc
|
|
- select the doc and move to the workspace
|
|
- save and embed
|
|
|
|

|
|
|
|
- Chat again:
|
|
|
|

|