Commit Graph

162 Commits

Author SHA1 Message Date
sixgod 471943bfd7 support INT4 inference 2024-11-01 10:21:56 +00:00
sixgod 24c8e1b317 transformers4.46 and vllm0.6.3 2024-11-01 09:24:39 +00:00
sixgod 2ba0aa4b54 transformers4.46 and vllm0.6.3 2024-11-01 09:23:05 +00:00
sixgod 02831d280b transformers4.46 and vllm0.6.3 2024-11-01 09:20:57 +00:00
sixgod a8069aca73 transformers4.46 and vllm0.6.3 2024-11-01 09:14:20 +00:00
sixgod 1d68300ecf transformers4.46 and vllm0.6.3 2024-11-01 09:11:59 +00:00
sixgod 9b39ba6d1b transformers4.46 and vllm0.6.3 2024-11-01 09:06:04 +00:00
sixgod e3e6de52c4 transformers4.46 and vllm0.6.3 2024-11-01 09:00:39 +00:00
zR 6bf9f85f70 remove wrong path 2024-10-29 01:41:10 +08:00
zR 94776fb841 update for transformers 4.46 2024-10-29 01:40:11 +08:00
zR c2c28bc45c transforemrs>=4.46 support 2024-10-29 00:13:41 +08:00
Zhengxiao Du 9cd635a825
Update README.md 2024-10-28 14:05:15 +08:00
Yuxuan.Zhang 4e9b473d18
Merge pull request from huolongguo1O/patch-1
解决finetune.py在加载配置文件时的错误
2024-10-16 17:36:19 +08:00
Yuxuan.Zhang 5142bdb6e1
Merge pull request from sixsixcoder/main
Add GLM-4v-9B model support for vllm framework
2024-10-15 14:55:35 +08:00
sixgod f25bba83b7
Rename llm_cli_vision_demo.py to vllm_cli_vision_demo.py 2024-10-12 22:31:04 +08:00
sixgod df567d31be
Create llm_cli_vision_demo.py 2024-10-12 22:30:37 +08:00
sixgod 68c6b52f61
Update README.md 2024-10-12 21:32:23 +08:00
sixgod 9ea3707c9e
Update README_en.md 2024-10-12 21:30:49 +08:00
sixgod 034bf5c448
Update README_en.md: Add GLM-4v-9B model support for vllm framework 2024-10-12 21:30:13 +08:00
zR 3e7735d4f7 update the req and chatglm_tokenizer.py 2024-10-06 14:09:05 +08:00
Yuxuan.Zhang 23773d94e2
Merge pull request from zhipuch/main
update readme
2024-09-26 13:52:20 +08:00
zhipuch 10f23a0cd3 update readme 2024-09-24 11:17:59 +00:00
zhipuch 0ef093c1a2 update readme 2024-09-24 11:15:20 +00:00
huolongguo1O f6ee34eb6e
Update finetune.py
上一个commit在配置文件添加了freezeV选项,但在finetune.py中没有支持。1c2676415c (diff-a70f5de59d3f0aced07176ede208e2d2f420d7eafd2d9cf2478f93636160e538)
暂时添加一句占位吧,虽然这个选项对纯文本微调没啥用
2024-09-17 09:50:19 +08:00
Yuxuan.Zhang 81af3cfc5a
Merge pull request from sixsixcoder/main
update that multi-GPUs inference with transformers in glm-4 and glm-4v
2024-09-11 19:52:17 +08:00
sixsixcoder 188c7956a1 transformers with glm4v lora adapter 2024-09-11 10:34:24 +00:00
Yushi Bai 281398e021
Update README.md 2024-09-10 23:55:33 +08:00
sixsixcoder 4995f03934 update that multi-GPUs inference with transformers in glm-4 and glm-4v 2024-09-10 07:16:42 +00:00
Yuxuan.Zhang 7d62623461
Merge pull request from sixsixcoder/main
Add support for OpenAI API server on the GLM-4v-9B model
2024-09-10 14:46:50 +08:00
Yuxuan.Zhang 5fe70a075b
Merge pull request from zhipuch/froze
Froze vision layers
2024-09-07 01:00:29 +08:00
zhipuch 1c2676415c freeze vision for sft 2024-09-06 09:31:27 +00:00
sixgod 88472f3ac2
Merge branch 'main' into main 2024-09-06 15:27:52 +08:00
sixsixcoder 851cc51b56 update readme 2024-09-06 07:25:16 +00:00
zR e058a509da fix readme 2024-09-06 15:15:04 +08:00
zR 5aca9bcb95 update readme 2024-09-06 15:14:25 +08:00
sixsixcoder a8c6d91b97 update readme 2024-09-06 06:55:24 +00:00
zhipuch 12cbfb6be0 update data 2024-09-06 06:21:12 +00:00
sixsixcoder 728b670b70 update readme 2024-09-06 06:04:23 +00:00
sixsixcoder 9f98825a63 add glm4v openai server 2024-09-06 05:59:41 +00:00
zhipuch 06bf1bb754 add data for finetune 2024-09-05 02:20:20 +00:00
zR 0f6a7c94fe Update .gitignore 2024-09-05 09:54:14 +08:00
zR 19f2f91fb5
Merge pull request from sixsixcoder/main
vLLM with LoRA adapter
2024-09-05 09:52:59 +08:00
sixgod cb038cd2d3
Update README_en.md 2024-09-04 21:20:45 +08:00
sixgod 7422d118e8
Update README.md 2024-09-04 21:19:37 +08:00
sixsixcoder af2fc45585 lora adapter with vllm 2024-09-04 10:30:21 +00:00
sixsixcoder d4a3b7ddba lora adapter with vllm 2024-09-04 10:28:22 +00:00
zR 1d78cfb3d7
Merge pull request from zhipuch/ftv
Support plain text fine-tuning
2024-09-04 18:11:15 +08:00
sixsixcoder fafa33d351 lora adapter with vllm 2024-09-04 09:10:03 +00:00
zhipuch cbe73627ff Support plain text fine-tuning 2024-09-03 09:52:02 +00:00
zR f0d67ff4a4 Update README.md 2024-08-17 11:02:40 +08:00