Commit Graph

74 Commits

Author SHA1 Message Date
efort 4e5cf388b4 更新 basic_demo/glm_server.py 2025-02-10 16:11:46 +08:00
efort 78cf805670 更新 basic_demo/glm4v_server.py 2025-02-10 15:36:35 +08:00
efort 483f37d290 更新 basic_demo/glm_server.py 2025-02-10 15:36:11 +08:00
efort fec6a8e275 更新 basic_demo/openai_api_server.py 2025-02-10 15:29:49 +08:00
efort 7cf4f4f422 更新 basic_demo/openai_api_server.py 2025-02-10 15:05:51 +08:00
zhipuch 4dc2f76e68 update:trans_batch_demo 2024-12-31 16:07:08 +08:00
zR e14f187090 Merge branch 'main' of https://github.com/THUDM/GLM-4 2024-12-09 15:54:29 +08:00
sixgod 5c70856738
fix bug in glm-4v openai_server 2024-12-02 18:02:26 +08:00
zR 4c66fa48d1 Merge branch 'main' of https://github.com/THUDM/GLM-4 2024-11-18 11:51:56 +08:00
zR 804733811f change of readme and req for vllm new version 2024-11-18 10:10:43 +08:00
sixsixcoder 476a066830 support vllm 0.6.3 2024-11-12 16:45:17 +08:00
zR d71b8c2284 comment with trust_remote_code=True 2024-11-01 18:49:55 +08:00
sixgod 471943bfd7 support INT4 inference 2024-11-01 10:21:56 +00:00
sixgod 9b39ba6d1b transformers4.46 and vllm0.6.3 2024-11-01 09:06:04 +00:00
sixgod e3e6de52c4 transformers4.46 and vllm0.6.3 2024-11-01 09:00:39 +00:00
zR 6bf9f85f70 remove wrong path 2024-10-29 01:41:10 +08:00
zR 94776fb841 update for transformers 4.46 2024-10-29 01:40:11 +08:00
zR c2c28bc45c transforemrs>=4.46 support 2024-10-29 00:13:41 +08:00
sixgod f25bba83b7
Rename llm_cli_vision_demo.py to vllm_cli_vision_demo.py 2024-10-12 22:31:04 +08:00
sixgod df567d31be
Create llm_cli_vision_demo.py 2024-10-12 22:30:37 +08:00
zR 3e7735d4f7 update the req and chatglm_tokenizer.py 2024-10-06 14:09:05 +08:00
zhipuch 10f23a0cd3 update readme 2024-09-24 11:17:59 +00:00
zhipuch 0ef093c1a2 update readme 2024-09-24 11:15:20 +00:00
sixsixcoder 188c7956a1 transformers with glm4v lora adapter 2024-09-11 10:34:24 +00:00
sixgod 88472f3ac2
Merge branch 'main' into main 2024-09-06 15:27:52 +08:00
zR 5aca9bcb95 update readme 2024-09-06 15:14:25 +08:00
sixsixcoder a8c6d91b97 update readme 2024-09-06 06:55:24 +00:00
sixsixcoder 9f98825a63 add glm4v openai server 2024-09-06 05:59:41 +00:00
sixgod cb038cd2d3
Update README_en.md 2024-09-04 21:20:45 +08:00
sixgod 7422d118e8
Update README.md 2024-09-04 21:19:37 +08:00
sixsixcoder af2fc45585 lora adapter with vllm 2024-09-04 10:30:21 +00:00
sixsixcoder d4a3b7ddba lora adapter with vllm 2024-09-04 10:28:22 +00:00
sixsixcoder fafa33d351 lora adapter with vllm 2024-09-04 09:10:03 +00:00
peilongchencc 8dcd196955 去除trans_web_demo中无用的parse_text模块 2024-08-15 18:54:56 +08:00
zR 9a27e77bba requirement update 2024-08-11 17:59:00 +08:00
zR 913cb6dc06 部分依赖更新 2024-07-24 10:17:07 +08:00
ghohoj f887394c02 fix: fix flow output when having tools 2024-07-22 10:17:09 +08:00
zR 4ab7a1efd1 依赖更新 2024-07-16 17:08:50 +08:00
zR 3dec01c6d5 dtype use bf16 2024-07-02 01:00:12 +08:00
jiawei 6c6d4637fb
Update README.md 2024-06-25 16:12:59 +08:00
zR e5b5630498 fix #232 2024-06-24 23:45:04 +08:00
zR 5722878e25 update readme 2024-06-20 21:00:46 +08:00
zR 1de40b055f update report 2024-06-20 11:33:05 +08:00
zR b475ebe8ae update finetune demo 2024-06-19 11:22:31 +08:00
zR bab384d193
Merge pull request #155 from qq332982511/add_api_backend
composite demo add openai backend
2024-06-19 01:14:14 +08:00
zR ab011a344e add requirement of finetune 2024-06-19 01:12:20 +08:00
L eb8aeb37c1 修正非流式输出和process_response tools参数的bug 2024-06-18 21:02:29 +08:00
zR ef015f88f9 跟进OpenAI server部分代码解释
with liuzhenghua
2024-06-18 18:13:12 +08:00
zR 5c4bf6201c fix openai stream function 2024-06-15 20:59:23 +08:00
黄乐乐 35ba249d28 Add GLM-4V-9B vision model gradio webui 2024-06-13 16:57:07 +08:00