Commit graph

3831 commits

Author SHA1 Message Date
Krrish Dholakia
d62da29cbe fix: fix linting issues 2023-11-24 15:46:25 -08:00
Krrish Dholakia
fe900966c1 docs(simple_proxy.md): adding tracking spend via proxy to docs 2023-11-24 15:41:52 -08:00
Krrish Dholakia
1a6ea20a0b fix(main.py): fixing linting issues 2023-11-24 15:25:51 -08:00
Krrish Dholakia
bc84b38154 feat(proxy_server.py): new /key/info endpoint to access key information (master key only) 2023-11-24 15:24:50 -08:00
Krrish Dholakia
4f22e7de18 feat(proxy_server.py): tracking spend per api key 2023-11-24 15:14:06 -08:00
Krrish Dholakia
2b52e6995c docs(perplexity.md): fix model list
https://github.com/BerriAI/litellm/issues/908
2023-11-24 14:14:47 -08:00
ishaan-jaff
41c6388185 (test) completion with many params 2023-11-24 14:01:21 -08:00
ishaan-jaff
a6bea946b3 (fix) completion: when logit bias is None 2023-11-24 14:01:21 -08:00
ishaan-jaff
0f873c756d (fix) completion: OpenAI/Azure filter out None params 2023-11-24 14:01:21 -08:00
Krrish Dholakia
f1dbdb58bb docs(simple_proxy.md): add cooldown to docs 2023-11-24 13:37:16 -08:00
Krrish Dholakia
2686894823 fix(router.py): fix retry logic 2023-11-24 13:27:44 -08:00
Krrish Dholakia
16e1070dbe test: refactor testing order 2023-11-24 12:47:28 -08:00
Krrish Dholakia
2a033fd8a2 test(test_router_cooldowns.py): adding logging 2023-11-24 12:30:08 -08:00
Krrish Dholakia
12dbdc4c15 docs(simple_proxy.md): add tutorial for doing fallbacks + retries + timeouts on the proxy 2023-11-24 12:20:38 -08:00
Krrish Dholakia
3dcbf6197a test(test_router.py): reintegrate router testing 2023-11-24 11:48:24 -08:00
Krrish Dholakia
c75e90663c test(test_stream_chunk_builder.py): fix setting api key 2023-11-24 11:47:48 -08:00
Krrish Dholakia
2e8d582a34 fix(proxy_server.py): fix linting issues 2023-11-24 11:39:01 -08:00
Krish Dholakia
d3d8b86eaa Updated config.yml 2023-11-24 11:34:24 -08:00
Krrish Dholakia
4a5dae3941 fix(main.py): fix streaming_chunk_builder to return usage 2023-11-24 11:27:04 -08:00
ishaan-jaff
8edfcd8e5d (fix) prisma using: secrets.compare_digest 2023-11-24 10:02:08 -08:00
Ishaan Jaff
86e405c9ab
Merge pull request #904 from Manouchehri/fix-auth-1
Fix OpenAPI auth spec.
2023-11-24 09:44:30 -08:00
Ishaan Jaff
2e140fe56a
Merge pull request #903 from kumaranvpl/main
Fix wrong model mentioned in simple proxy doc page
2023-11-24 09:39:06 -08:00
David Manouchehri
ac08e3616c
Fix timing attack on master_key. 2023-11-24 12:12:29 -05:00
David Manouchehri
5b6f227170
Fix master key check. 2023-11-24 12:03:30 -05:00
David Manouchehri
3fa3a767b3
Fix OpenAPI auth spec. 2023-11-24 11:59:33 -05:00
Kumaran Rajendhiran
d695e9dfdd Fix minor README issue 2023-11-24 19:53:51 +05:30
Kumaran Rajendhiran
79569e4f2c Fix wrong model mentioned in simple proxy doc page 2023-11-24 19:46:35 +05:30
ishaan-jaff
0a23af7990 (feat) proxy: cost tracking add track_cost_callback 2023-11-24 05:53:23 -08:00
Ishaan Jaff
b03a7ae0cd
Merge pull request #902 from kumaranvpl/main
Fail gracefully if ollama is already being served
2023-11-24 05:15:12 -08:00
Kumaran Rajendhiran
01fad94485 Fail gracefully if ollama is already being served 2023-11-24 16:52:55 +05:30
ishaan-jaff
824136667f (fix) add azure/ to model. TY Krrish ! 2023-11-23 21:44:08 -08:00
Krrish Dholakia
c9e37b8fb8 fix(proxy_server.py): fix linting errors 2023-11-23 21:42:39 -08:00
ishaan-jaff
19fb24cd15 (feat) cost tracking for azure llms 2023-11-23 21:41:38 -08:00
Krrish Dholakia
9a44433844 feat(proxy_server.py): /key/delete endpoint 2023-11-23 21:37:53 -08:00
ishaan-jaff
0c210cc96c (test) caching 2023-11-23 21:23:41 -08:00
ishaan-jaff
c929c274f6 (test) proxy: test_embedding 2023-11-23 21:16:51 -08:00
ishaan-jaff
486037ff8b (fix) proxy: /embeddings 2023-11-23 21:16:51 -08:00
Krrish Dholakia
5c18771f9d fix(router.py): fixing embedding call 2023-11-23 21:07:02 -08:00
ishaan-jaff
ba9dbbe6d5 (test) router: acompletion + caching 2023-11-23 21:05:48 -08:00
Krrish Dholakia
02464f6661 fix(router.py): use an older version of async for compatibility 2023-11-23 21:00:53 -08:00
ishaan-jaff
39072bd196 (test) proxy - azure/chat/completion test 2023-11-23 20:56:41 -08:00
ishaan-jaff
ca852e1dcd (fix) caching use model, messages, temp, max_tokens as cache_key 2023-11-23 20:56:41 -08:00
Krrish Dholakia
187403c5cc fix(router.py): add modelgroup to call metadata 2023-11-23 20:55:49 -08:00
ishaan-jaff
db8ed601b5 (test) temp comment router to test ci/cd 2023-11-23 20:28:43 -08:00
ishaan-jaff
8a72487dcf (test) proxy_server /chat/completions 2023-11-23 20:28:06 -08:00
Krrish Dholakia
7d221fe863 fix(utils.py): make failure logging sync 2023-11-23 20:19:27 -08:00
ishaan-jaff
31c5b4572b (fix) proxy: prisma.schema 2023-11-23 20:11:42 -08:00
ishaan-jaff
4816232674 (test) cost tracker 2023-11-23 18:28:37 -08:00
ishaan-jaff
695eaac542 (fix) cost calculator for FT: gpt-3.5 2023-11-23 18:28:21 -08:00
ishaan-jaff
b6f8dbbc0e (test) caching 2023-11-23 18:17:12 -08:00