zR
|
a9fe1aba02
|
add openai demo stream and function call
fix #130 #124
|
2024-06-09 16:11:20 +08:00 |
zR
|
20a9f26ec6
|
更新部分说明
|
2024-06-08 13:26:43 +08:00 |
zR
|
abe93e093d
|
openai demo update
#64 #36
|
2024-06-07 22:14:00 +08:00 |
Lian Junhong
|
0b979f8bdb
|
Fix openai_api_server request_id issue
add request_id=f"{time.time()}" to Fix API concurrency request issue
|
2024-06-06 12:37:26 +08:00 |
zR
|
1683d673d2
|
fix #60 and #51
|
2024-06-06 10:14:15 +08:00 |
zR
|
3f26ccc208
|
fix readme error
|
2024-06-06 10:00:11 +08:00 |
xusenlin
|
11e244c0f2
|
fix process response function
|
2024-06-05 16:11:54 +08:00 |
Lao
|
205b7db3cc
|
Update openai_api_server.py
Changed vllm’s gpu memory utilization
|
2024-06-05 11:48:20 +08:00 |
duzx16
|
c0a6d1e0fa
|
init commit
|
2024-06-05 10:22:16 +08:00 |