Krrish Dholakia
|
2a033fd8a2
|
test(test_router_cooldowns.py): adding logging
|
2023-11-24 12:30:08 -08:00 |
|
Krrish Dholakia
|
3dcbf6197a
|
test(test_router.py): reintegrate router testing
|
2023-11-24 11:48:24 -08:00 |
|
Krrish Dholakia
|
c75e90663c
|
test(test_stream_chunk_builder.py): fix setting api key
|
2023-11-24 11:47:48 -08:00 |
|
Krrish Dholakia
|
2e8d582a34
|
fix(proxy_server.py): fix linting issues
|
2023-11-24 11:39:01 -08:00 |
|
Krrish Dholakia
|
4a5dae3941
|
fix(main.py): fix streaming_chunk_builder to return usage
|
2023-11-24 11:27:04 -08:00 |
|
ishaan-jaff
|
19fb24cd15
|
(feat) cost tracking for azure llms
|
2023-11-23 21:41:38 -08:00 |
|
Krrish Dholakia
|
9a44433844
|
feat(proxy_server.py): /key/delete endpoint
|
2023-11-23 21:37:53 -08:00 |
|
ishaan-jaff
|
0c210cc96c
|
(test) caching
|
2023-11-23 21:23:41 -08:00 |
|
ishaan-jaff
|
c929c274f6
|
(test) proxy: test_embedding
|
2023-11-23 21:16:51 -08:00 |
|
ishaan-jaff
|
ba9dbbe6d5
|
(test) router: acompletion + caching
|
2023-11-23 21:05:48 -08:00 |
|
ishaan-jaff
|
39072bd196
|
(test) proxy - azure/chat/completion test
|
2023-11-23 20:56:41 -08:00 |
|
Krrish Dholakia
|
187403c5cc
|
fix(router.py): add modelgroup to call metadata
|
2023-11-23 20:55:49 -08:00 |
|
ishaan-jaff
|
db8ed601b5
|
(test) temp comment router to test ci/cd
|
2023-11-23 20:28:43 -08:00 |
|
ishaan-jaff
|
8a72487dcf
|
(test) proxy_server /chat/completions
|
2023-11-23 20:28:06 -08:00 |
|
Krrish Dholakia
|
7d221fe863
|
fix(utils.py): make failure logging sync
|
2023-11-23 20:19:27 -08:00 |
|
ishaan-jaff
|
4816232674
|
(test) cost tracker
|
2023-11-23 18:28:37 -08:00 |
|
ishaan-jaff
|
b6f8dbbc0e
|
(test) caching
|
2023-11-23 18:17:12 -08:00 |
|
Krrish Dholakia
|
94dc3f66f3
|
fix(utils.py): remove eos token for zephyr models
|
2023-11-23 17:47:39 -08:00 |
|
Krrish Dholakia
|
f24786095a
|
fix(vertex_ai.py): fix exception mapping for vertex ai
|
2023-11-23 17:35:33 -08:00 |
|
ishaan-jaff
|
704af2ca34
|
(test) testing fix
|
2023-11-23 17:20:22 -08:00 |
|
ishaan-jaff
|
3118101e2c
|
(test) router: raises exception
|
2023-11-23 17:09:36 -08:00 |
|
ishaan-jaff
|
3ee141e1e2
|
(test) cleanup
|
2023-11-23 17:09:36 -08:00 |
|
Krrish Dholakia
|
abe2a009f9
|
test(test_completion.py): fix azure read from env variable test
|
2023-11-23 16:58:07 -08:00 |
|
ishaan-jaff
|
60dab0d700
|
(test) router: azure pass key in model list
|
2023-11-23 16:51:52 -08:00 |
|
ishaan-jaff
|
f7a5b31b96
|
(test) azure test passing key as completion param
|
2023-11-23 16:51:52 -08:00 |
|
Krrish Dholakia
|
c273d6f0d6
|
fix(router.py): add support for context window fallbacks on router
|
2023-11-23 16:43:02 -08:00 |
|
ishaan-jaff
|
a1bb880872
|
(test) router raise exceptions
|
2023-11-23 16:28:19 -08:00 |
|
Krrish Dholakia
|
afac42e93a
|
fix(router.py): enable async completions with model fallbacks
|
2023-11-23 16:15:57 -08:00 |
|
ishaan-jaff
|
1365092c89
|
(test) router: ensure errors are raised
|
2023-11-23 16:08:59 -08:00 |
|
ishaan-jaff
|
525e01824a
|
(test) ensure exception raised when incorrect key passed
|
2023-11-23 16:08:59 -08:00 |
|
Krrish Dholakia
|
8ac03e492f
|
fix(router.py): enable fallbacks for sync completions
|
2023-11-23 16:06:46 -08:00 |
|
ishaan-jaff
|
bb00595429
|
(feat) cost: azure gpt + testing
|
2023-11-23 14:20:48 -08:00 |
|
ishaan-jaff
|
90a9116a17
|
(test) strict test ft:gpt-3.5 cost calc
|
2023-11-23 14:11:01 -08:00 |
|
ishaan-jaff
|
da7c07574f
|
(test) cost calc on azure
|
2023-11-23 13:50:09 -08:00 |
|
Krrish Dholakia
|
4f183dc6a0
|
fix(utils.py): support reading api keys dynamically from the os environment
|
2023-11-23 13:41:56 -08:00 |
|
ishaan-jaff
|
2c50ea94c8
|
(test) embedding: azurea - dd response_ms
|
2023-11-23 11:15:52 -08:00 |
|
Krrish Dholakia
|
61fc76a8c4
|
fix(router.py): fix caching for tracking cooldowns + usage
|
2023-11-23 11:13:32 -08:00 |
|
ishaan-jaff
|
fbd2ab4c71
|
(test) caching ensure we always test add/get caching redis
|
2023-11-23 08:27:44 -08:00 |
|
ishaan-jaff
|
db146bc40a
|
(test) router with fallback deployments
|
2023-11-22 20:52:56 -08:00 |
|
ishaan-jaff
|
8ebc1b974c
|
(chore) run ci/cd again
|
2023-11-22 20:34:14 -08:00 |
|
Krrish Dholakia
|
5d5ca9f7ef
|
fix(router.py): add support for cooldowns with redis
|
2023-11-22 19:54:22 -08:00 |
|
Krrish Dholakia
|
3e76d4b422
|
feat(router.py): add server cooldown logic
|
2023-11-22 15:59:48 -08:00 |
|
ishaan-jaff
|
c38782521c
|
(test)timeout error on openai embedding
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
40e88eec4b
|
(test)timeout errors
|
2023-11-22 14:25:55 -08:00 |
|
ishaan-jaff
|
3059f30672
|
(test) verify azure response have expected keys
|
2023-11-22 14:25:55 -08:00 |
|
Krrish Dholakia
|
0b4e10e068
|
test(test_embedding.py): fix the embedding test
|
2023-11-22 14:09:45 -08:00 |
|
ishaan-jaff
|
40dd38508f
|
(test) embedding stricter testing
|
2023-11-22 13:50:45 -08:00 |
|
Krrish Dholakia
|
b0801f61e6
|
test(test_caching.py): cleaning up tests
|
2023-11-22 13:43:48 -08:00 |
|
Krrish Dholakia
|
78582e158a
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
Krrish Dholakia
|
604ad41eac
|
fix(proxy_server.py): If master key is set, only master key can be used to generate new keys
|
2023-11-22 10:18:28 -08:00 |
|