在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称(OpenSource Name):AliyunContainerService/gpushare-scheduler-extender开源软件地址(OpenSource Url):https://github.com/AliyunContainerService/gpushare-scheduler-extender开源编程语言(OpenSource Language):Go 78.2%开源软件介绍(OpenSource Introduction):GPU Sharing Scheduler Extender in KubernetesOverviewMore and more data scientists run their Nvidia GPU based inference tasks on Kubernetes. Some of these tasks can be run on the same Nvidia GPU device to increase GPU utilization. So one important challenge is how to share GPUs between the pods. The community is also very interested in this topic. Now there is a GPU sharing solution on native Kubernetes: it is based on scheduler extenders and device plugin mechanism, so you can reuse this solution easily in your own Kubernetes. Prerequisites
DesignFor more details about the design of this project, please read this Design document. SetupYou can follow this Installation Guide. If you are using Alibaba Cloud Kubernetes, please follow this doc to install with Helm Charts. User GuideYou can check this User Guide. DevelopingScheduler Extendergit clone https://github.com/AliyunContainerService/gpushare-scheduler-extender.git && cd gpushare-scheduler-extender
docker build -t cheyang/gpushare-scheduler-extender . Device Plugingit clone https://github.com/AliyunContainerService/gpushare-device-plugin.git && cd gpushare-device-plugin
docker build -t cheyang/gpushare-device-plugin . Kubectl Extension
mkdir -p $GOPATH/src/github.com/AliyunContainerService
cd $GOPATH/src/github.com/AliyunContainerService
git clone https://github.com/AliyunContainerService/gpushare-device-plugin.git
cd gpushare-device-plugin
go build -o $GOPATH/bin/kubectl-inspect-gpushare-v2 cmd/inspect/*.go Demo- Demo 1: Deploy multiple GPU Shared Pods and schedule them on the same GPU device in binpack way- Demo 2: Avoid GPU memory requests that fit at the node level, but not at the GPU device levelRelated ProjectRoadmap
AdoptersIf you are intrested in GPUShare and would like to share your experiences with others, you are warmly welcome to add your information on ADOPTERS.md page. We will continuousely discuss new requirements and feature design with you in advance. Acknowledgments
|
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论