efort
|
78cf805670
|
更新 basic_demo/glm4v_server.py
|
2025-02-10 15:36:35 +08:00 |
efort
|
483f37d290
|
更新 basic_demo/glm_server.py
|
2025-02-10 15:36:11 +08:00 |
efort
|
fec6a8e275
|
更新 basic_demo/openai_api_server.py
|
2025-02-10 15:29:49 +08:00 |
efort
|
7cf4f4f422
|
更新 basic_demo/openai_api_server.py
|
2025-02-10 15:05:51 +08:00 |
zhipuch
|
4dc2f76e68
|
update:trans_batch_demo
|
2024-12-31 16:07:08 +08:00 |
zR
|
e14f187090
|
Merge branch 'main' of https://github.com/THUDM/GLM-4
|
2024-12-09 15:54:29 +08:00 |
sixgod
|
5c70856738
|
fix bug in glm-4v openai_server
|
2024-12-02 18:02:26 +08:00 |
zR
|
4c66fa48d1
|
Merge branch 'main' of https://github.com/THUDM/GLM-4
|
2024-11-18 11:51:56 +08:00 |
zR
|
804733811f
|
change of readme and req for vllm new version
|
2024-11-18 10:10:43 +08:00 |
sixsixcoder
|
476a066830
|
support vllm 0.6.3
|
2024-11-12 16:45:17 +08:00 |
zR
|
d71b8c2284
|
comment with trust_remote_code=True
|
2024-11-01 18:49:55 +08:00 |
sixgod
|
471943bfd7
|
support INT4 inference
|
2024-11-01 10:21:56 +00:00 |
sixgod
|
9b39ba6d1b
|
transformers4.46 and vllm0.6.3
|
2024-11-01 09:06:04 +00:00 |
sixgod
|
e3e6de52c4
|
transformers4.46 and vllm0.6.3
|
2024-11-01 09:00:39 +00:00 |
zR
|
6bf9f85f70
|
remove wrong path
|
2024-10-29 01:41:10 +08:00 |
zR
|
94776fb841
|
update for transformers 4.46
|
2024-10-29 01:40:11 +08:00 |
zR
|
c2c28bc45c
|
transforemrs>=4.46 support
|
2024-10-29 00:13:41 +08:00 |
sixgod
|
f25bba83b7
|
Rename llm_cli_vision_demo.py to vllm_cli_vision_demo.py
|
2024-10-12 22:31:04 +08:00 |
sixgod
|
df567d31be
|
Create llm_cli_vision_demo.py
|
2024-10-12 22:30:37 +08:00 |
zR
|
3e7735d4f7
|
update the req and chatglm_tokenizer.py
|
2024-10-06 14:09:05 +08:00 |
zhipuch
|
10f23a0cd3
|
update readme
|
2024-09-24 11:17:59 +00:00 |
zhipuch
|
0ef093c1a2
|
update readme
|
2024-09-24 11:15:20 +00:00 |
sixsixcoder
|
188c7956a1
|
transformers with glm4v lora adapter
|
2024-09-11 10:34:24 +00:00 |
sixgod
|
88472f3ac2
|
Merge branch 'main' into main
|
2024-09-06 15:27:52 +08:00 |
zR
|
5aca9bcb95
|
update readme
|
2024-09-06 15:14:25 +08:00 |
sixsixcoder
|
a8c6d91b97
|
update readme
|
2024-09-06 06:55:24 +00:00 |
sixsixcoder
|
9f98825a63
|
add glm4v openai server
|
2024-09-06 05:59:41 +00:00 |
sixgod
|
cb038cd2d3
|
Update README_en.md
|
2024-09-04 21:20:45 +08:00 |
sixgod
|
7422d118e8
|
Update README.md
|
2024-09-04 21:19:37 +08:00 |
sixsixcoder
|
af2fc45585
|
lora adapter with vllm
|
2024-09-04 10:30:21 +00:00 |
sixsixcoder
|
d4a3b7ddba
|
lora adapter with vllm
|
2024-09-04 10:28:22 +00:00 |
sixsixcoder
|
fafa33d351
|
lora adapter with vllm
|
2024-09-04 09:10:03 +00:00 |
peilongchencc
|
8dcd196955
|
去除trans_web_demo中无用的parse_text模块
|
2024-08-15 18:54:56 +08:00 |
zR
|
9a27e77bba
|
requirement update
|
2024-08-11 17:59:00 +08:00 |
zR
|
913cb6dc06
|
部分依赖更新
|
2024-07-24 10:17:07 +08:00 |
ghohoj
|
f887394c02
|
fix: fix flow output when having tools
|
2024-07-22 10:17:09 +08:00 |
zR
|
4ab7a1efd1
|
依赖更新
|
2024-07-16 17:08:50 +08:00 |
zR
|
3dec01c6d5
|
dtype use bf16
|
2024-07-02 01:00:12 +08:00 |
jiawei
|
6c6d4637fb
|
Update README.md
|
2024-06-25 16:12:59 +08:00 |
zR
|
e5b5630498
|
fix #232
|
2024-06-24 23:45:04 +08:00 |
zR
|
5722878e25
|
update readme
|
2024-06-20 21:00:46 +08:00 |
zR
|
1de40b055f
|
update report
|
2024-06-20 11:33:05 +08:00 |
zR
|
b475ebe8ae
|
update finetune demo
|
2024-06-19 11:22:31 +08:00 |
zR
|
bab384d193
|
Merge pull request #155 from qq332982511/add_api_backend
composite demo add openai backend
|
2024-06-19 01:14:14 +08:00 |
zR
|
ab011a344e
|
add requirement of finetune
|
2024-06-19 01:12:20 +08:00 |
L
|
eb8aeb37c1
|
修正非流式输出和process_response tools参数的bug
|
2024-06-18 21:02:29 +08:00 |
zR
|
ef015f88f9
|
跟进OpenAI server部分代码解释
with liuzhenghua
|
2024-06-18 18:13:12 +08:00 |
zR
|
5c4bf6201c
|
fix openai stream function
|
2024-06-15 20:59:23 +08:00 |
黄乐乐
|
35ba249d28
|
Add GLM-4V-9B vision model gradio webui
|
2024-06-13 16:57:07 +08:00 |
yybear
|
293ca3b834
|
composite demo add openai backend
|
2024-06-12 23:59:22 +08:00 |