Commit graph

3189 commits

Author SHA1 Message Date
Krrish Dholakia
49932ac90a test: skip flaky tests 2023-12-23 12:37:38 +05:30
Krrish Dholakia
e620d2f219 fix(utils.py): log user_id to langfuse 2023-12-23 12:14:09 +05:30
Krish Dholakia
2df5ce4b7c
Merge pull request #1182 from sumanth13131/usage-based-routing-fix
usage_based_routing_fix
2023-12-23 11:50:34 +05:30
Krish Dholakia
d195787db7
Merge pull request #1183 from maxdeichmann/improve-langchain-integration
Improve langfuse integration
2023-12-23 11:47:36 +05:30
Krish Dholakia
710c809478
Merge pull request #1195 from AllentDan/fix-routing
fix least_busy router by updating min_traffic
2023-12-23 11:45:35 +05:30
Krish Dholakia
03fd5da5ae
Merge pull request #1203 from Manouchehri/bedrock-cloudflare-ai-gateway-1
Add aws_bedrock_runtime_endpoint support
2023-12-23 11:44:04 +05:30
Krish Dholakia
8afdc12918
Merge pull request #1211 from sihyeonn/fix/sh-success-callback
fix: success_callback logic for cost_tracking
2023-12-23 11:41:30 +05:30
Krish Dholakia
81617534b6
Merge pull request #1213 from neubig/vertex_chat_generate_content
Make vertex ai work with generate_content
2023-12-23 11:40:43 +05:30
Krrish Dholakia
79a79b16e1 bump: version 1.15.6 → 1.15.7 2023-12-23 10:03:49 +05:30
Krrish Dholakia
43f4096014 fix(langsmith.py): fix langsmith streaming logging 2023-12-23 10:02:35 +05:30
Krrish Dholakia
89ee9fe400 fix(proxy_server.py): manage budget at user-level not key-level
https://github.com/BerriAI/litellm/issues/1220
2023-12-22 15:10:38 +05:30
Krrish Dholakia
979575a2a6 fix(proxy_server.py): handle misformatted json body in chat completion request 2023-12-22 12:30:36 +05:30
Krrish Dholakia
1e526c7e06 bump: version 1.15.5 → 1.15.6 2023-12-22 12:24:20 +05:30
Krrish Dholakia
f1270a7c78 fix(utils.py): handle ollama yielding a dict 2023-12-22 12:23:42 +05:30
Krrish Dholakia
eb2d13e2fb test(test_completion.py-+-test_streaming.py): add ollama endpoint to ci/cd pipeline 2023-12-22 12:21:33 +05:30
Krrish Dholakia
57607f111a fix(ollama.py): use litellm.request timeout for async call timeout 2023-12-22 11:22:24 +05:30
Krrish Dholakia
278f61f3ed fix(utils.py): handle 'os.environ/' being passed in as kwargs 2023-12-22 11:08:44 +05:30
Max Deichmann
1c68f5557d
Merge branch 'main' into improve-langchain-integration 2023-12-21 23:50:01 +01:00
Max Deichmann
61cf9b1f19 first commit 2023-12-21 23:49:33 +01:00
Graham Neubig
2362544344
Update the request_str 2023-12-21 09:58:06 -05:00
Krrish Dholakia
1a32228da5 feat(proxy_server.py): support max budget on proxy 2023-12-21 16:07:20 +05:30
Krrish Dholakia
14115d0d60 feat(proxy_server.py): add new images/generation endpoint 2023-12-21 15:39:09 +05:30
Krrish Dholakia
be68796eba fix(router.py): add support for async image generation endpoints 2023-12-21 14:38:44 +05:30
Krrish Dholakia
81078c4004 fix(proxy/utils.py): jsonify object before db writes 2023-12-21 13:03:14 +05:30
Sihyeon Jang
4f41c3c513 fix: success_callback logic for cost_tracking
Signed-off-by: Sihyeon Jang <uneedsihyeon@gmail.com>
2023-12-21 16:09:59 +09:00
Krrish Dholakia
812f9ca1b3 fix(azure.py): correctly raise async exceptions 2023-12-21 12:23:07 +05:30
Krrish Dholakia
87fca1808a test(test_amazing_vertex_completion.py): fix project name 2023-12-21 12:14:34 +05:30
Krish Dholakia
5edc987209
Merge pull request #1199 from neubig/add_safety_settings_default
Add a default for safety settings in vertex AI
2023-12-21 09:07:23 +05:30
ishaan-jaff
3e5cfee1f4 (ci/cd) run again 2023-12-21 07:25:22 +05:30
ishaan-jaff
b701a356cc (fix) vertex ai auth file 2023-12-21 07:22:25 +05:30
Krrish Dholakia
6795f0447a fix(utils.py): fix non_default_param pop error for ollama 2023-12-21 06:59:13 +05:30
David Manouchehri
93c4556eb0
Add aws_bedrock_runtime_endpoint support. 2023-12-20 19:31:43 -05:00
Graham Neubig
6e9267ca66 Make vertex_chat work with generate_content 2023-12-20 15:32:44 -05:00
Mateo Cámara
b72d372aa7 feat: added explicit args to acomplete 2023-12-20 19:49:12 +01:00
Graham Neubig
482b3b5bc3 Add a default for safety settings in vertex AI 2023-12-20 13:12:50 -05:00
Krrish Dholakia
04bbd0649f fix(router.py): only do sync image gen fallbacks for now
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version

n
2023-12-20 19:10:59 +05:30
Krrish Dholakia
350389f501 fix(utils.py): add support for anyscale function calling 2023-12-20 17:48:33 +05:30
Krrish Dholakia
4040f60feb feat(router.py): support async image generation on router 2023-12-20 17:24:20 +05:30
Krrish Dholakia
f355e03515 feat(main.py): add async image generation support 2023-12-20 16:58:40 +05:30
Krrish Dholakia
f59b9436be feat(main.py): add async image generation support 2023-12-20 16:58:15 +05:30
Krrish Dholakia
b3962e483f feat(azure.py): add support for azure image generations endpoint 2023-12-20 16:37:21 +05:30
AllentDan
6b19db0327
fix least_busy router by updating min_traffic 2023-12-20 18:16:00 +08:00
Krrish Dholakia
f0df28362a feat(ollama.py): add support for ollama function calling 2023-12-20 14:59:55 +05:30
ishaan-jaff
683a1ee979 (feat) proxy key/generate pass metadata in requests 2023-12-20 13:42:49 +05:30
ishaan-jaff
7ad21de441 (feat) proxy /key/generate add metadata to _types 2023-12-20 13:42:49 +05:30
ishaan-jaff
c4b7ab6579 (feat) proxy - add metadata for keys 2023-12-20 13:42:49 +05:30
Krish Dholakia
93c4efd715
Merge pull request #1190 from neubig/add_vertexai_safety_settings
Add partial support of VertexAI safety settings
2023-12-20 07:31:10 +00:00
ishaan-jaff
229b56fc35 (docs) swagger - add embedding tag 2023-12-20 09:04:56 +05:30
Graham Neubig
2d15e5384b Add partial support of vertexai safety settings 2023-12-19 22:26:55 -05:00
ishaan-jaff
aa78415894 (docs) swager - add embeddings tag 2023-12-20 06:29:36 +05:30