Commit Graph

4 Commits

Author SHA1 Message Date
zR 12dd318013 add glm-4v-9b stress test 2024-06-05 12:55:41 +08:00
Lao 205b7db3cc
Update openai_api_server.py
Changed vllm’s gpu memory utilization
2024-06-05 11:48:20 +08:00
zR d95f131b03 更新VLLM demo的参数 2024-06-05 11:43:08 +08:00
duzx16 c0a6d1e0fa init commit 2024-06-05 10:22:16 +08:00