Commit Graph

47 Commits

Author SHA1 Message Date
zR 5aca9bcb95 update readme 2024-09-06 15:14:25 +08:00
sixgod cb038cd2d3
Update README_en.md 2024-09-04 21:20:45 +08:00
sixgod 7422d118e8
Update README.md 2024-09-04 21:19:37 +08:00
sixsixcoder af2fc45585 lora adapter with vllm 2024-09-04 10:30:21 +00:00
sixsixcoder d4a3b7ddba lora adapter with vllm 2024-09-04 10:28:22 +00:00
sixsixcoder fafa33d351 lora adapter with vllm 2024-09-04 09:10:03 +00:00
peilongchencc 8dcd196955 去除trans_web_demo中无用的parse_text模块 2024-08-15 18:54:56 +08:00
zR 9a27e77bba requirement update 2024-08-11 17:59:00 +08:00
zR 913cb6dc06 部分依赖更新 2024-07-24 10:17:07 +08:00
ghohoj f887394c02 fix: fix flow output when having tools 2024-07-22 10:17:09 +08:00
zR 4ab7a1efd1 依赖更新 2024-07-16 17:08:50 +08:00
zR 3dec01c6d5 dtype use bf16 2024-07-02 01:00:12 +08:00
jiawei 6c6d4637fb
Update README.md 2024-06-25 16:12:59 +08:00
zR e5b5630498 fix #232 2024-06-24 23:45:04 +08:00
zR 5722878e25 update readme 2024-06-20 21:00:46 +08:00
zR 1de40b055f update report 2024-06-20 11:33:05 +08:00
zR b475ebe8ae update finetune demo 2024-06-19 11:22:31 +08:00
zR bab384d193
Merge pull request #155 from qq332982511/add_api_backend
composite demo add openai backend
2024-06-19 01:14:14 +08:00
zR ab011a344e add requirement of finetune 2024-06-19 01:12:20 +08:00
L eb8aeb37c1 修正非流式输出和process_response tools参数的bug 2024-06-18 21:02:29 +08:00
zR ef015f88f9 跟进OpenAI server部分代码解释
with liuzhenghua
2024-06-18 18:13:12 +08:00
zR 5c4bf6201c fix openai stream function 2024-06-15 20:59:23 +08:00
黄乐乐 35ba249d28 Add GLM-4V-9B vision model gradio webui 2024-06-13 16:57:07 +08:00
yybear 293ca3b834 composite demo add openai backend 2024-06-12 23:59:22 +08:00
zR adeeb0e8e0 Merge branch 'main' of https://github.com/THUDM/GLM-4 2024-06-09 16:11:23 +08:00
zR a9fe1aba02 add openai demo stream and function call
fix #130 #124
2024-06-09 16:11:20 +08:00
zR 56426f2186
Merge pull request #91 from ztxtech/main
增加设置提示词的块
2024-06-08 20:18:57 +08:00
Tianxiang Zhan 8de4c76203
Update trans_web_demo.py
当设置prompt的时候跳过一轮
2024-06-08 16:39:07 +08:00
Tianxiang Zhan 76fff757b6
修正role 2024-06-08 16:12:58 +08:00
zR 20a9f26ec6 更新部分说明 2024-06-08 13:26:43 +08:00
zR abe93e093d openai demo update
#64 #36
2024-06-07 22:14:00 +08:00
zR 7fcaeba6cc finetune and vision demo update 2024-06-07 16:53:56 +08:00
Tianxiang Zhan f4fc0a316e
增加设置提示词的块 2024-06-06 19:57:58 +08:00
zR 9c2df689ac
Merge pull request #70 from T-Atlas/patch-1
Fix openai_api_server request_id issue
2024-06-06 17:35:09 +08:00
zR ce2667cf5d fix issue #74 2024-06-06 16:18:14 +08:00
zR 8102212b9f fix with vllm requirements.txt 2024-06-06 13:57:22 +08:00
Lian Junhong 0b979f8bdb
Fix openai_api_server request_id issue
add request_id=f"{time.time()}" to Fix API concurrency request issue
2024-06-06 12:37:26 +08:00
zR 1683d673d2 fix #60 and #51 2024-06-06 10:14:15 +08:00
zR 3f26ccc208 fix readme error 2024-06-06 10:00:11 +08:00
zR a263b69376
Merge pull request #28 from xusenlinzy/main
fix process response function
2024-06-05 18:07:07 +08:00
zR 0423d7ca6d update link 2024-06-05 16:24:36 +08:00
xusenlin 11e244c0f2 fix process response function 2024-06-05 16:11:54 +08:00
zR 29480b7394 add vision demo 2024-06-05 13:21:23 +08:00
zR 12dd318013 add glm-4v-9b stress test 2024-06-05 12:55:41 +08:00
Lao 205b7db3cc
Update openai_api_server.py
Changed vllm’s gpu memory utilization
2024-06-05 11:48:20 +08:00
zR d95f131b03 更新VLLM demo的参数 2024-06-05 11:43:08 +08:00
duzx16 c0a6d1e0fa init commit 2024-06-05 10:22:16 +08:00