Krrish Dholakia
57607f111a
fix(ollama.py): use litellm.request timeout for async call timeout
2023-12-22 11:22:24 +05:30
Krrish Dholakia
6ade2c74b5
bump: version 1.15.4 → 1.15.5
2023-12-22 11:08:54 +05:30
Krrish Dholakia
278f61f3ed
fix(utils.py): handle 'os.environ/' being passed in as kwargs
2023-12-22 11:08:44 +05:30
Max Deichmann
1c68f5557d
Merge branch 'main' into improve-langchain-integration
2023-12-21 23:50:01 +01:00
Max Deichmann
61cf9b1f19
first commit
2023-12-21 23:49:33 +01:00
Krrish Dholakia
d87e59db25
bump: version 1.15.3 → 1.15.4
2023-12-21 21:20:23 +05:30
Graham Neubig
2362544344
Update the request_str
2023-12-21 09:58:06 -05:00
Krrish Dholakia
1a32228da5
feat(proxy_server.py): support max budget on proxy
2023-12-21 16:07:20 +05:30
Krrish Dholakia
14115d0d60
feat(proxy_server.py): add new images/generation endpoint
2023-12-21 15:39:09 +05:30
Krrish Dholakia
8101ad6801
bump: version 1.15.2 → 1.15.3
2023-12-21 14:39:29 +05:30
Krrish Dholakia
be68796eba
fix(router.py): add support for async image generation endpoints
2023-12-21 14:38:44 +05:30
Krrish Dholakia
a4aa645cf6
bump: version 1.15.1 → 1.15.2
2023-12-21 13:23:00 +05:30
Krrish Dholakia
81078c4004
fix(proxy/utils.py): jsonify object before db writes
2023-12-21 13:03:14 +05:30
Sihyeon Jang
4f41c3c513
fix: success_callback logic for cost_tracking
...
Signed-off-by: Sihyeon Jang <uneedsihyeon@gmail.com>
2023-12-21 16:09:59 +09:00
Krrish Dholakia
812f9ca1b3
fix(azure.py): correctly raise async exceptions
2023-12-21 12:23:07 +05:30
Krrish Dholakia
87fca1808a
test(test_amazing_vertex_completion.py): fix project name
2023-12-21 12:14:34 +05:30
Krish Dholakia
5edc987209
Merge pull request #1199 from neubig/add_safety_settings_default
...
Add a default for safety settings in vertex AI
2023-12-21 09:07:23 +05:30
Krrish Dholakia
97f6475035
docs(health.md): add docs on health checks for embedding models
2023-12-21 07:54:04 +05:30
ishaan-jaff
f6407aaf74
(docs) add metadata keys/generate
2023-12-21 07:43:53 +05:30
ishaan-jaff
fbab7371dc
(docs) proxy - virtual keys
2023-12-21 07:36:32 +05:30
Ishaan Jaff
d54de58e31
Merge pull request #1197 from Rested/main
...
docker build and push on release
2023-12-21 07:28:17 +05:30
ishaan-jaff
3e5cfee1f4
(ci/cd) run again
2023-12-21 07:25:22 +05:30
ishaan-jaff
b701a356cc
(fix) vertex ai auth file
2023-12-21 07:22:25 +05:30
Krrish Dholakia
6795f0447a
fix(utils.py): fix non_default_param pop error for ollama
2023-12-21 06:59:13 +05:30
David Manouchehri
93c4556eb0
Add aws_bedrock_runtime_endpoint support.
2023-12-20 19:31:43 -05:00
Graham Neubig
6e9267ca66
Make vertex_chat work with generate_content
2023-12-20 15:32:44 -05:00
Graham Neubig
482b3b5bc3
Add a default for safety settings in vertex AI
2023-12-20 13:12:50 -05:00
Krrish Dholakia
04bbd0649f
fix(router.py): only do sync image gen fallbacks for now
...
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version
n
2023-12-20 19:10:59 +05:30
Reuben Thomas-Davis
fe4427907d
🗑️ remove unused docker workflow for clarity
2023-12-20 12:41:23 +00:00
Reuben Thomas-Davis
50af89e853
💚 docker build and push on release
...
when a github release is published a docker image is pushed to ghcr avoiding manual workflow dispatch method (but still making it available as a fallback)
2023-12-20 12:40:43 +00:00
Krrish Dholakia
350389f501
fix(utils.py): add support for anyscale function calling
2023-12-20 17:48:33 +05:30
Krrish Dholakia
4040f60feb
feat(router.py): support async image generation on router
2023-12-20 17:24:20 +05:30
Krrish Dholakia
f355e03515
feat(main.py): add async image generation support
2023-12-20 16:58:40 +05:30
Krrish Dholakia
f59b9436be
feat(main.py): add async image generation support
2023-12-20 16:58:15 +05:30
Krrish Dholakia
b3962e483f
feat(azure.py): add support for azure image generations endpoint
2023-12-20 16:37:21 +05:30
AllentDan
6b19db0327
fix least_busy router by updating min_traffic
2023-12-20 18:16:00 +08:00
Krrish Dholakia
f0df28362a
feat(ollama.py): add support for ollama function calling
2023-12-20 14:59:55 +05:30
ishaan-jaff
bab8f3350d
(docs) openrouter
2023-12-20 13:42:49 +05:30
ishaan-jaff
7d20ea23d1
(docs) set openrouter params
2023-12-20 13:42:49 +05:30
ishaan-jaff
683a1ee979
(feat) proxy key/generate pass metadata in requests
2023-12-20 13:42:49 +05:30
ishaan-jaff
7ad21de441
(feat) proxy /key/generate add metadata to _types
2023-12-20 13:42:49 +05:30
ishaan-jaff
c4b7ab6579
(feat) proxy - add metadata for keys
2023-12-20 13:42:49 +05:30
Krish Dholakia
93c4efd715
Merge pull request #1190 from neubig/add_vertexai_safety_settings
...
Add partial support of VertexAI safety settings
2023-12-20 07:31:10 +00:00
Ishaan Jaff
9fa2fe0c37
Merge pull request #1185 from navidre/prevent_key_log_documentation
...
Sample code to prevent logging API key in callback to Slack
2023-12-20 09:09:10 +05:30
ishaan-jaff
229b56fc35
(docs) swagger - add embedding tag
2023-12-20 09:04:56 +05:30
Graham Neubig
2d15e5384b
Add partial support of vertexai safety settings
2023-12-19 22:26:55 -05:00
ishaan-jaff
aa78415894
(docs) swager - add embeddings tag
2023-12-20 06:29:36 +05:30
ishaan-jaff
9548334e2f
(docs) swagger docs add description
2023-12-20 06:27:26 +05:30
ishaan-jaff
8b26e64b5d
(fix) proxy: add link t swagger docs on startup
2023-12-20 06:02:05 +05:30
ishaan-jaff
cd34b859df
(docs) swagger endpoint
2023-12-20 05:49:45 +05:30