Windows 安装 Xinference_xinference windows 部署

作者：你好赵伟 | 2024-04-05 02:39:12

踩

xinference windows 部署

Windows 安装 Xinference

0. 引言
1. 创建虚拟环境
2. 安装 pytorch
3. 安装 llama_cpp_python
4. 安装 chatglm-cpp
5. 安装 Xinference
6. 设置 model 路径
7. 启动 Xinference
8. 查看 Cluster Information

0. 引言

Xorbits Inference（Xinference）是一个性能强大且功能全面的分布式推理框架。可用于大语言模型（LLM），语音识别模型，多模态模型等各种模型的推理。通过 Xorbits Inference，你可以轻松地一键部署你自己的模型或内置的前沿开源模型。无论你是研究者，开发者，或是数据科学家，都可以通过 Xorbits Inference 与最前沿的 AI 模型，发掘更多可能。

为什么选择 Xinference？

在这里插入图片描述

启动后的画面，

在这里插入图片描述

1. 创建虚拟环境

conda create -n xinference python=3.10 -y
conda activate xinference 
1
2

2. 安装 pytorch

conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.8 -c pytorch -c nvidia
1

3. 安装 llama_cpp_python

pip install https://github.com/abetlen/llama-cpp-python/releases/download/v0.2.55/llama_cpp_python-0.2.55-cp310-cp310-win_amd64.whl
1

refer: https://github.com/abetlen/llama-cpp-python

4. 安装 chatglm-cpp

pip install https://github.com/li-plus/chatglm.cpp/releases/download/v0.3.1/chatglm_cpp-0.3.1-cp310-cp310-win_amd64.whl
1

refer: https://github.com/li-plus/chatglm.cpp

5. 安装 Xinference

pip install "xinference[all]"
1

refer: https://github.com/xorbitsai/inference

6. 设置 model 路径

在我的电脑上设置环境变量，路径请根据各自环境修改。

XINFERENCE_HOME=F:\XinferenceCache
1

7. 启动 Xinference

xinference-local -H <your_ip>
1

在这里插入图片描述
选择一个 Model 运行，

在这里插入图片描述
运行成功后，在 “Running Models” 页面可以查看，

在这里插入图片描述

8. 查看 Cluster Information

点击 Cluster Information，

在这里插入图片描述
完结！

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/你好赵伟/article/detail/362759