Krrish Dholakia
|
e2d275f1b7
|
fix(utils.py): add exception mapping for bedrock image internal server error
|
2024-07-19 19:30:41 -07:00 |
|
Krrish Dholakia
|
4de19cb116
|
test(test_router_timeout.py): fix test
|
2024-07-19 19:00:55 -07:00 |
|
Krrish Dholakia
|
335cf65452
|
fix(anthropic.py): fix streaming client
|
2024-07-19 18:55:00 -07:00 |
|
Krrish Dholakia
|
9e098326d0
|
test: skip duplicate test
|
2024-07-19 18:51:50 -07:00 |
|
Krrish Dholakia
|
ccb8035949
|
fix(batches/main.py): fix linting error
|
2024-07-19 18:26:13 -07:00 |
|
Krrish Dholakia
|
99aa311083
|
fix(user_api_key_auth.py): update team values in token cache if refreshed more recently
|
2024-07-19 17:35:59 -07:00 |
|
Krish Dholakia
|
614e292bed
|
Merge pull request #4788 from BerriAI/litellm_ai21_jamba
feat(bedrock_httpx.py): add ai21 jamba instruct as bedrock model
|
2024-07-19 17:11:36 -07:00 |
|
Krrish Dholakia
|
35e640076b
|
fix(user_api_key_auth.py): update valid token cache with updated team object cache
|
2024-07-19 17:06:49 -07:00 |
|
Ishaan Jaff
|
f7b1c33208
|
fix typing errors
|
2024-07-19 16:43:31 -07:00 |
|
Ishaan Jaff
|
d3e7910c4d
|
Merge pull request #4798 from BerriAI/litellm_fix_health_check
fix health check - make sure one failing deployment does not stop the health check
|
2024-07-19 16:41:12 -07:00 |
|
Ishaan Jaff
|
f3ac6493e8
|
fix typing errors
|
2024-07-19 16:18:53 -07:00 |
|
Ishaan Jaff
|
43e5890f77
|
fix health check
|
2024-07-19 15:56:35 -07:00 |
|
Krrish Dholakia
|
fa7037e48a
|
fix(files/main.py): fix linting error
|
2024-07-19 15:50:25 -07:00 |
|
Krrish Dholakia
|
bd914c714c
|
fix(files/main.py): fix linting error
|
2024-07-19 15:48:22 -07:00 |
|
Krrish Dholakia
|
757dedd4c8
|
fix(factory.py): refactor factory to use httpx client
|
2024-07-19 15:35:05 -07:00 |
|
Krrish Dholakia
|
4c4f032a75
|
fix(factory.py): refactor factory to use httpx client
|
2024-07-19 15:32:01 -07:00 |
|
Krish Dholakia
|
f973bd8109
|
Merge branch 'main' into litellm_ai21_jamba
|
2024-07-19 15:11:55 -07:00 |
|
Ishaan Jaff
|
1797021d53
|
ci/cd run again
|
2024-07-19 14:05:22 -07:00 |
|
Ishaan Jaff
|
af80e271c9
|
ui new build
|
2024-07-19 14:02:28 -07:00 |
|
Ishaan Jaff
|
df1ac92222
|
fix add fix to update spend logs
|
2024-07-19 12:49:23 -07:00 |
|
Ishaan Jaff
|
90824e2d21
|
ui new build
|
2024-07-19 11:00:13 -07:00 |
|
Ishaan Jaff
|
ae316d2d9a
|
fix ui - make default session 24 hours
|
2024-07-19 10:17:45 -07:00 |
|
Ishaan Jaff
|
defc341f39
|
docs fireworks ai
|
2024-07-19 10:08:48 -07:00 |
|
Ishaan Jaff
|
2e766a7b1f
|
ci/cd run again
|
2024-07-19 08:25:56 -07:00 |
|
Sophia Loris
|
d779253949
|
resolve merge conflicts
|
2024-07-19 09:45:53 -05:00 |
|
Sophia Loris
|
d5c65c6be2
|
Add support for Triton streaming & triton async completions
|
2024-07-19 09:35:27 -05:00 |
|
Krish Dholakia
|
2ad1c0455a
|
Merge pull request #4773 from titusz/patch-1
Add missing `num_gpu` ollama configuration parameter
|
2024-07-18 22:47:39 -07:00 |
|
Krrish Dholakia
|
c56456be64
|
fix(anthropic.py): revert client to requests library
|
2024-07-18 22:45:41 -07:00 |
|
Marc Abramowitz
|
780a6293dc
|
Alias /health/liveliness as /health/liveness
The latter is the more common term in Kubernetes, so it's nice to support that.
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
6f393be66b
|
docs - tag based routing
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
d9c051adff
|
add tags to metadata
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
fa26d3f96f
|
fix test
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
1ab5c1a227
|
check if using tag based routing
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
52d0f6a808
|
control using enable_tag_filtering
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
e298515034
|
fix use tags as a litellm param
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
79c5788ad9
|
fix remove previous code on free/paid tier
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
ad46e6a61f
|
router - refactor to tag based routing
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
38c50e674e
|
fix ui make ui session last 24 hours
|
2024-07-18 22:40:51 -07:00 |
|
Krrish Dholakia
|
db3063d3d3
|
test: fix test
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
d42963a0ae
|
router - use free paid tier routing
|
2024-07-18 22:40:51 -07:00 |
|
Ishaan Jaff
|
b0f0898f2f
|
helper to get_deployments_for_tier
|
2024-07-18 22:40:51 -07:00 |
|
Krrish Dholakia
|
f2401d6d5e
|
feat(vertex_ai_anthropic.py): support response_schema for vertex ai anthropic calls
allows passing response_schema for anthropic calls. supports schema validation.
|
2024-07-18 22:40:35 -07:00 |
|
Ishaan Jaff
|
154fa64d0a
|
Merge pull request #4781 from msabramo/liveness-alias-of-liveliness
Alias `/health/liveliness` as `/health/liveness`
|
2024-07-18 22:20:09 -07:00 |
|
Ishaan Jaff
|
f04397e19a
|
Merge pull request #4789 from BerriAI/litellm_router_refactor
[Feat-Router] - Tag based routing
|
2024-07-18 22:19:18 -07:00 |
|
Ishaan Jaff
|
4dff995932
|
Merge pull request #4787 from BerriAI/litellm_increase_default_session_time_ui
[Fix] Admin UI - make ui session last 12 hours
|
2024-07-18 22:19:09 -07:00 |
|
Ishaan Jaff
|
a4338bec11
|
docs - tag based routing
|
2024-07-18 22:18:10 -07:00 |
|
Krrish Dholakia
|
a0ac3e3c7d
|
test: fix test
|
2024-07-18 22:05:47 -07:00 |
|
Ishaan Jaff
|
502b739b33
|
add tags to metadata
|
2024-07-18 21:55:53 -07:00 |
|
Ishaan Jaff
|
331e2bbc17
|
fix test
|
2024-07-18 21:49:36 -07:00 |
|
Krish Dholakia
|
967964a51c
|
Merge branch 'main' into litellm_anthropic_response_schema_support
|
2024-07-18 20:40:16 -07:00 |
|