Commit graph

5297 commits

Author SHA1 Message Date
Krrish Dholakia
57607f111a fix(ollama.py): use litellm.request timeout for async call timeout 2023-12-22 11:22:24 +05:30
Krrish Dholakia
6ade2c74b5 bump: version 1.15.4 → 1.15.5 2023-12-22 11:08:54 +05:30
Krrish Dholakia
278f61f3ed fix(utils.py): handle 'os.environ/' being passed in as kwargs 2023-12-22 11:08:44 +05:30
Max Deichmann
1c68f5557d
Merge branch 'main' into improve-langchain-integration 2023-12-21 23:50:01 +01:00
Max Deichmann
61cf9b1f19 first commit 2023-12-21 23:49:33 +01:00
Krrish Dholakia
d87e59db25 bump: version 1.15.3 → 1.15.4 2023-12-21 21:20:23 +05:30
Graham Neubig
2362544344
Update the request_str 2023-12-21 09:58:06 -05:00
Krrish Dholakia
1a32228da5 feat(proxy_server.py): support max budget on proxy 2023-12-21 16:07:20 +05:30
Krrish Dholakia
14115d0d60 feat(proxy_server.py): add new images/generation endpoint 2023-12-21 15:39:09 +05:30
Krrish Dholakia
8101ad6801 bump: version 1.15.2 → 1.15.3 2023-12-21 14:39:29 +05:30
Krrish Dholakia
be68796eba fix(router.py): add support for async image generation endpoints 2023-12-21 14:38:44 +05:30
Krrish Dholakia
a4aa645cf6 bump: version 1.15.1 → 1.15.2 2023-12-21 13:23:00 +05:30
Krrish Dholakia
81078c4004 fix(proxy/utils.py): jsonify object before db writes 2023-12-21 13:03:14 +05:30
Sihyeon Jang
4f41c3c513 fix: success_callback logic for cost_tracking
Signed-off-by: Sihyeon Jang <uneedsihyeon@gmail.com>
2023-12-21 16:09:59 +09:00
Krrish Dholakia
812f9ca1b3 fix(azure.py): correctly raise async exceptions 2023-12-21 12:23:07 +05:30
Krrish Dholakia
87fca1808a test(test_amazing_vertex_completion.py): fix project name 2023-12-21 12:14:34 +05:30
Krish Dholakia
5edc987209
Merge pull request #1199 from neubig/add_safety_settings_default
Add a default for safety settings in vertex AI
2023-12-21 09:07:23 +05:30
Krrish Dholakia
97f6475035 docs(health.md): add docs on health checks for embedding models 2023-12-21 07:54:04 +05:30
ishaan-jaff
f6407aaf74 (docs) add metadata keys/generate 2023-12-21 07:43:53 +05:30
ishaan-jaff
fbab7371dc (docs) proxy - virtual keys 2023-12-21 07:36:32 +05:30
Ishaan Jaff
d54de58e31
Merge pull request #1197 from Rested/main
docker build and push on release
2023-12-21 07:28:17 +05:30
ishaan-jaff
3e5cfee1f4 (ci/cd) run again 2023-12-21 07:25:22 +05:30
ishaan-jaff
b701a356cc (fix) vertex ai auth file 2023-12-21 07:22:25 +05:30
Krrish Dholakia
6795f0447a fix(utils.py): fix non_default_param pop error for ollama 2023-12-21 06:59:13 +05:30
David Manouchehri
93c4556eb0
Add aws_bedrock_runtime_endpoint support. 2023-12-20 19:31:43 -05:00
Graham Neubig
6e9267ca66 Make vertex_chat work with generate_content 2023-12-20 15:32:44 -05:00
Graham Neubig
482b3b5bc3 Add a default for safety settings in vertex AI 2023-12-20 13:12:50 -05:00
Krrish Dholakia
04bbd0649f fix(router.py): only do sync image gen fallbacks for now
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version

n
2023-12-20 19:10:59 +05:30
Reuben Thomas-Davis
fe4427907d 🗑️ remove unused docker workflow for clarity 2023-12-20 12:41:23 +00:00
Reuben Thomas-Davis
50af89e853 💚 docker build and push on release
when a github release is published a docker image is pushed to ghcr avoiding manual workflow dispatch method (but still making it available as a fallback)
2023-12-20 12:40:43 +00:00
Krrish Dholakia
350389f501 fix(utils.py): add support for anyscale function calling 2023-12-20 17:48:33 +05:30
Krrish Dholakia
4040f60feb feat(router.py): support async image generation on router 2023-12-20 17:24:20 +05:30
Krrish Dholakia
f355e03515 feat(main.py): add async image generation support 2023-12-20 16:58:40 +05:30
Krrish Dholakia
f59b9436be feat(main.py): add async image generation support 2023-12-20 16:58:15 +05:30
Krrish Dholakia
b3962e483f feat(azure.py): add support for azure image generations endpoint 2023-12-20 16:37:21 +05:30
AllentDan
6b19db0327
fix least_busy router by updating min_traffic 2023-12-20 18:16:00 +08:00
Krrish Dholakia
f0df28362a feat(ollama.py): add support for ollama function calling 2023-12-20 14:59:55 +05:30
ishaan-jaff
bab8f3350d (docs) openrouter 2023-12-20 13:42:49 +05:30
ishaan-jaff
7d20ea23d1 (docs) set openrouter params 2023-12-20 13:42:49 +05:30
ishaan-jaff
683a1ee979 (feat) proxy key/generate pass metadata in requests 2023-12-20 13:42:49 +05:30
ishaan-jaff
7ad21de441 (feat) proxy /key/generate add metadata to _types 2023-12-20 13:42:49 +05:30
ishaan-jaff
c4b7ab6579 (feat) proxy - add metadata for keys 2023-12-20 13:42:49 +05:30
Krish Dholakia
93c4efd715
Merge pull request #1190 from neubig/add_vertexai_safety_settings
Add partial support of VertexAI safety settings
2023-12-20 07:31:10 +00:00
Ishaan Jaff
9fa2fe0c37
Merge pull request #1185 from navidre/prevent_key_log_documentation
Sample code to prevent logging API key in callback to Slack
2023-12-20 09:09:10 +05:30
ishaan-jaff
229b56fc35 (docs) swagger - add embedding tag 2023-12-20 09:04:56 +05:30
Graham Neubig
2d15e5384b Add partial support of vertexai safety settings 2023-12-19 22:26:55 -05:00
ishaan-jaff
aa78415894 (docs) swager - add embeddings tag 2023-12-20 06:29:36 +05:30
ishaan-jaff
9548334e2f (docs) swagger docs add description 2023-12-20 06:27:26 +05:30
ishaan-jaff
8b26e64b5d (fix) proxy: add link t swagger docs on startup 2023-12-20 06:02:05 +05:30
ishaan-jaff
cd34b859df (docs) swagger endpoint 2023-12-20 05:49:45 +05:30