Commit graph

2876 commits

Author SHA1 Message Date
Krrish Dholakia
fc33429628 bump: version 1.15.5 → 1.15.6 2023-12-22 12:24:20 +05:30
Krrish Dholakia
f5ffea471d fix(utils.py): handle ollama yielding a dict 2023-12-22 12:23:42 +05:30
Krrish Dholakia
a65dfdde94 test(test_completion.py-+-test_streaming.py): add ollama endpoint to ci/cd pipeline 2023-12-22 12:21:33 +05:30
Krrish Dholakia
ae288c97fb fix(ollama.py): use litellm.request timeout for async call timeout 2023-12-22 11:22:24 +05:30
Krrish Dholakia
e3d486efe2 fix(utils.py): handle 'os.environ/' being passed in as kwargs 2023-12-22 11:08:44 +05:30
Max Deichmann
15ccf02f1e Merge branch 'main' into improve-langchain-integration 2023-12-21 23:50:01 +01:00
Max Deichmann
7f98b53434 first commit 2023-12-21 23:49:33 +01:00
Graham Neubig
23b7a8a5ac Update the request_str 2023-12-21 09:58:06 -05:00
Krrish Dholakia
aefaa157e5 feat(proxy_server.py): support max budget on proxy 2023-12-21 16:07:20 +05:30
Krrish Dholakia
158325497d feat(proxy_server.py): add new images/generation endpoint 2023-12-21 15:39:09 +05:30
Krrish Dholakia
c084f04a35 fix(router.py): add support for async image generation endpoints 2023-12-21 14:38:44 +05:30
Krrish Dholakia
fc935f0972 fix(proxy/utils.py): jsonify object before db writes 2023-12-21 13:03:14 +05:30
Sihyeon Jang
4cbcda607c fix: success_callback logic for cost_tracking
Signed-off-by: Sihyeon Jang <uneedsihyeon@gmail.com>
2023-12-21 16:09:59 +09:00
Krrish Dholakia
ee2a467708 fix(azure.py): correctly raise async exceptions 2023-12-21 12:23:07 +05:30
Krrish Dholakia
18a981b624 test(test_amazing_vertex_completion.py): fix project name 2023-12-21 12:14:34 +05:30
Krish Dholakia
4b354f699a Merge pull request #1199 from neubig/add_safety_settings_default
Add a default for safety settings in vertex AI
2023-12-21 09:07:23 +05:30
ishaan-jaff
fba2370341 (ci/cd) run again 2023-12-21 07:25:22 +05:30
ishaan-jaff
8112397a3b (fix) vertex ai auth file 2023-12-21 07:22:25 +05:30
Krrish Dholakia
8c7d62e62d fix(utils.py): fix non_default_param pop error for ollama 2023-12-21 06:59:13 +05:30
David Manouchehri
98bb605701 Add aws_bedrock_runtime_endpoint support. 2023-12-20 19:31:43 -05:00
Graham Neubig
b630528ef3 Make vertex_chat work with generate_content 2023-12-20 15:32:44 -05:00
Graham Neubig
c6b803ffaa Add a default for safety settings in vertex AI 2023-12-20 13:12:50 -05:00
Krrish Dholakia
b873833340 fix(router.py): only do sync image gen fallbacks for now
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version

n
2023-12-20 19:10:59 +05:30
Krrish Dholakia
77b11daf28 fix(utils.py): add support for anyscale function calling 2023-12-20 17:48:33 +05:30
Krrish Dholakia
89b808d767 feat(router.py): support async image generation on router 2023-12-20 17:24:20 +05:30
Krrish Dholakia
a8f997eceb feat(main.py): add async image generation support 2023-12-20 16:58:40 +05:30
Krrish Dholakia
a14d51e074 feat(main.py): add async image generation support 2023-12-20 16:58:15 +05:30
Krrish Dholakia
23d0278739 feat(azure.py): add support for azure image generations endpoint 2023-12-20 16:37:21 +05:30
AllentDan
4b84f0f049 fix least_busy router by updating min_traffic 2023-12-20 18:16:00 +08:00
Krrish Dholakia
636ac9b605 feat(ollama.py): add support for ollama function calling 2023-12-20 14:59:55 +05:30
ishaan-jaff
36f42294c8 (feat) proxy key/generate pass metadata in requests 2023-12-20 13:42:49 +05:30
ishaan-jaff
b0df8789c5 (feat) proxy /key/generate add metadata to _types 2023-12-20 13:42:49 +05:30
ishaan-jaff
ffda18e551 (feat) proxy - add metadata for keys 2023-12-20 13:42:49 +05:30
Krish Dholakia
e9e5731c19 Merge pull request #1190 from neubig/add_vertexai_safety_settings
Add partial support of VertexAI safety settings
2023-12-20 07:31:10 +00:00
ishaan-jaff
c2c6b7cc31 (docs) swagger - add embedding tag 2023-12-20 09:04:56 +05:30
Graham Neubig
efb4af930b Add partial support of vertexai safety settings 2023-12-19 22:26:55 -05:00
ishaan-jaff
935e83d445 (docs) swager - add embeddings tag 2023-12-20 06:29:36 +05:30
ishaan-jaff
84f77bfaf8 (docs) swagger docs add description 2023-12-20 06:27:26 +05:30
ishaan-jaff
343e192914 (fix) proxy: add link t swagger docs on startup 2023-12-20 06:02:05 +05:30
Krrish Dholakia
b0300392b9 fix(utils.py): vertex ai exception mapping 2023-12-19 15:25:29 +00:00
Krrish Dholakia
40a9d62de9 fix(ollama.py): raise async errors 2023-12-19 15:01:12 +00:00
ishaan-jaff
3c37e0d58b (fix) proxy + ollama - raise exception correctly 2023-12-19 18:48:34 +05:30
Max Deichmann
a250b4d421 debug langfuse on env variable 2023-12-19 12:20:58 +01:00
Max Deichmann
00ff0113d0 add log 2023-12-19 10:06:03 +01:00
Max Deichmann
a43b8aa703 add users 2023-12-19 09:54:51 +01:00
Max Deichmann
a49e1d662e changes 2023-12-19 09:41:49 +01:00
sumanth
64dfd0d9a6 fix 2023-12-19 13:15:48 +05:30
ishaan-jaff
8cdcdda4c1 (fix) proxy - health checks support cli model 2023-12-19 12:55:20 +05:30
Krish Dholakia
7e3f9d344c Merge branch 'main' into main 2023-12-18 17:54:34 -08:00
Krrish Dholakia
00826e195e fix(router.py): init deployment_latency_map even if model_list is empty 2023-12-18 17:50:26 -08:00