ishaan-jaff
|
0f873c756d
|
(fix) completion: OpenAI/Azure filter out None params
|
2023-11-24 14:01:21 -08:00 |
|
Krrish Dholakia
|
f1dbdb58bb
|
docs(simple_proxy.md): add cooldown to docs
|
2023-11-24 13:37:16 -08:00 |
|
Krrish Dholakia
|
2686894823
|
fix(router.py): fix retry logic
|
2023-11-24 13:27:44 -08:00 |
|
Krrish Dholakia
|
16e1070dbe
|
test: refactor testing order
|
2023-11-24 12:47:28 -08:00 |
|
Krrish Dholakia
|
2a033fd8a2
|
test(test_router_cooldowns.py): adding logging
|
2023-11-24 12:30:08 -08:00 |
|
Krrish Dholakia
|
12dbdc4c15
|
docs(simple_proxy.md): add tutorial for doing fallbacks + retries + timeouts on the proxy
|
2023-11-24 12:20:38 -08:00 |
|
Krrish Dholakia
|
3dcbf6197a
|
test(test_router.py): reintegrate router testing
|
2023-11-24 11:48:24 -08:00 |
|
Krrish Dholakia
|
c75e90663c
|
test(test_stream_chunk_builder.py): fix setting api key
|
2023-11-24 11:47:48 -08:00 |
|
Krrish Dholakia
|
2e8d582a34
|
fix(proxy_server.py): fix linting issues
|
2023-11-24 11:39:01 -08:00 |
|
Krish Dholakia
|
d3d8b86eaa
|
Updated config.yml
|
2023-11-24 11:34:24 -08:00 |
|
Krrish Dholakia
|
4a5dae3941
|
fix(main.py): fix streaming_chunk_builder to return usage
|
2023-11-24 11:27:04 -08:00 |
|
ishaan-jaff
|
8edfcd8e5d
|
(fix) prisma using: secrets.compare_digest
|
2023-11-24 10:02:08 -08:00 |
|
Ishaan Jaff
|
86e405c9ab
|
Merge pull request #904 from Manouchehri/fix-auth-1
Fix OpenAPI auth spec.
|
2023-11-24 09:44:30 -08:00 |
|
Ishaan Jaff
|
2e140fe56a
|
Merge pull request #903 from kumaranvpl/main
Fix wrong model mentioned in simple proxy doc page
|
2023-11-24 09:39:06 -08:00 |
|
David Manouchehri
|
ac08e3616c
|
Fix timing attack on master_key.
|
2023-11-24 12:12:29 -05:00 |
|
David Manouchehri
|
5b6f227170
|
Fix master key check.
|
2023-11-24 12:03:30 -05:00 |
|
David Manouchehri
|
3fa3a767b3
|
Fix OpenAPI auth spec.
|
2023-11-24 11:59:33 -05:00 |
|
Kumaran Rajendhiran
|
d695e9dfdd
|
Fix minor README issue
|
2023-11-24 19:53:51 +05:30 |
|
Kumaran Rajendhiran
|
79569e4f2c
|
Fix wrong model mentioned in simple proxy doc page
|
2023-11-24 19:46:35 +05:30 |
|
ishaan-jaff
|
0a23af7990
|
(feat) proxy: cost tracking add track_cost_callback
|
2023-11-24 05:53:23 -08:00 |
|
Ishaan Jaff
|
b03a7ae0cd
|
Merge pull request #902 from kumaranvpl/main
Fail gracefully if ollama is already being served
|
2023-11-24 05:15:12 -08:00 |
|
Kumaran Rajendhiran
|
01fad94485
|
Fail gracefully if ollama is already being served
|
2023-11-24 16:52:55 +05:30 |
|
ishaan-jaff
|
824136667f
|
(fix) add azure/ to model. TY Krrish !
|
2023-11-23 21:44:08 -08:00 |
|
Krrish Dholakia
|
c9e37b8fb8
|
fix(proxy_server.py): fix linting errors
|
2023-11-23 21:42:39 -08:00 |
|
ishaan-jaff
|
19fb24cd15
|
(feat) cost tracking for azure llms
|
2023-11-23 21:41:38 -08:00 |
|
Krrish Dholakia
|
9a44433844
|
feat(proxy_server.py): /key/delete endpoint
|
2023-11-23 21:37:53 -08:00 |
|
ishaan-jaff
|
0c210cc96c
|
(test) caching
|
2023-11-23 21:23:41 -08:00 |
|
ishaan-jaff
|
c929c274f6
|
(test) proxy: test_embedding
|
2023-11-23 21:16:51 -08:00 |
|
ishaan-jaff
|
486037ff8b
|
(fix) proxy: /embeddings
|
2023-11-23 21:16:51 -08:00 |
|
Krrish Dholakia
|
5c18771f9d
|
fix(router.py): fixing embedding call
|
2023-11-23 21:07:02 -08:00 |
|
ishaan-jaff
|
ba9dbbe6d5
|
(test) router: acompletion + caching
|
2023-11-23 21:05:48 -08:00 |
|
Krrish Dholakia
|
02464f6661
|
fix(router.py): use an older version of async for compatibility
|
2023-11-23 21:00:53 -08:00 |
|
ishaan-jaff
|
39072bd196
|
(test) proxy - azure/chat/completion test
|
2023-11-23 20:56:41 -08:00 |
|
ishaan-jaff
|
ca852e1dcd
|
(fix) caching use model, messages, temp, max_tokens as cache_key
|
2023-11-23 20:56:41 -08:00 |
|
Krrish Dholakia
|
187403c5cc
|
fix(router.py): add modelgroup to call metadata
|
2023-11-23 20:55:49 -08:00 |
|
ishaan-jaff
|
db8ed601b5
|
(test) temp comment router to test ci/cd
|
2023-11-23 20:28:43 -08:00 |
|
ishaan-jaff
|
8a72487dcf
|
(test) proxy_server /chat/completions
|
2023-11-23 20:28:06 -08:00 |
|
Krrish Dholakia
|
7d221fe863
|
fix(utils.py): make failure logging sync
|
2023-11-23 20:19:27 -08:00 |
|
ishaan-jaff
|
31c5b4572b
|
(fix) proxy: prisma.schema
|
2023-11-23 20:11:42 -08:00 |
|
ishaan-jaff
|
4816232674
|
(test) cost tracker
|
2023-11-23 18:28:37 -08:00 |
|
ishaan-jaff
|
695eaac542
|
(fix) cost calculator for FT: gpt-3.5
|
2023-11-23 18:28:21 -08:00 |
|
ishaan-jaff
|
b6f8dbbc0e
|
(test) caching
|
2023-11-23 18:17:12 -08:00 |
|
ishaan-jaff
|
3660fb1f7f
|
(feat) caching: Use seed, max_tokens etc in cache key
|
2023-11-23 18:17:12 -08:00 |
|
Krrish Dholakia
|
1cf85b4eb9
|
fix(utils.py): vertex ai api error exception mapping
|
2023-11-23 17:50:50 -08:00 |
|
Krrish Dholakia
|
94dc3f66f3
|
fix(utils.py): remove eos token for zephyr models
|
2023-11-23 17:47:39 -08:00 |
|
Krrish Dholakia
|
f24786095a
|
fix(vertex_ai.py): fix exception mapping for vertex ai
|
2023-11-23 17:35:33 -08:00 |
|
ishaan-jaff
|
704af2ca34
|
(test) testing fix
|
2023-11-23 17:20:22 -08:00 |
|
ishaan-jaff
|
69c6bbd50b
|
(chore) remove bloat: deprecated api.litellm cache
|
2023-11-23 17:20:22 -08:00 |
|
Krrish Dholakia
|
2cc6acdeec
|
docs(routing.md): add docs on fallbacks, caching, retries, timeouts for router
|
2023-11-23 17:17:18 -08:00 |
|
ishaan-jaff
|
3118101e2c
|
(test) router: raises exception
|
2023-11-23 17:09:36 -08:00 |
|