Krish Dholakia
11f85d883f
LiteLLM Minor Fixes + Improvements ( #5474 )
...
* feat(proxy/_types.py): add lago billing to callbacks ui
Closes https://github.com/BerriAI/litellm/issues/5472
* fix(anthropic.py): return anthropic prompt caching information
Fixes https://github.com/BerriAI/litellm/issues/5364
* feat(bedrock/chat.py): support 'json_schema' for bedrock models
Closes https://github.com/BerriAI/litellm/issues/5434
* fix(bedrock/embed/embeddings.py): support async embeddings for amazon titan models
* fix: linting fixes
* fix: handle key errors
* fix(bedrock/chat.py): fix bedrock ai21 streaming object
* feat(bedrock/embed): support bedrock embedding optional params
* fix(databricks.py): fix usage chunk
* fix(internal_user_endpoints.py): apply internal user defaults, if user role updated
Fixes issue where user update wouldn't apply defaults
* feat(slack_alerting.py): provide multiple slack channels for a given alert type
multiple channels might be interested in receiving an alert for a given type
* docs(alerting.md): add multiple channel alerting to docs
2024-09-02 14:29:57 -07:00
Krish Dholakia
e474c3665a
Bedrock Embeddings refactor + model support ( #5462 )
...
* refactor(bedrock): initial commit to refactor bedrock to a folder
Improve code readability + maintainability
* refactor: more refactor work
* fix: fix imports
* feat(bedrock/embeddings.py): support translating embedding into amazon embedding formats
* fix: fix linting errors
* test: skip test on end of life model
* fix(cohere/embed.py): fix linting error
* fix(cohere/embed.py): fix typing
* fix(cohere/embed.py): fix post-call logging for cohere embedding call
* test(test_embeddings.py): fix error message assertion in test
2024-09-01 13:29:58 -07:00
Krish Dholakia
996c028127
Merge pull request #5393 from BerriAI/litellm_gemini_embedding_support
...
feat(vertex_ai_and_google_ai_studio): Support Google AI Studio Embedding Endpoint
2024-08-28 13:46:28 -07:00
Krrish Dholakia
cf5768cd0c
test(test_embeddings.py): fix test
2024-08-28 07:51:00 -07:00
Krrish Dholakia
57330d2d0d
feat(batch_embed_content_transformation.py): support google ai studio /batchEmbedContent endpoint
...
Allows for multiple strings to be given for embedding
2024-08-27 19:23:50 -07:00
Krrish Dholakia
4bb59b7b2c
feat(embeddings_handler.py): support async gemini embeddings
2024-08-27 18:31:57 -07:00
Krrish Dholakia
6a483a1908
fix(embeddings_handler.py): initial working commit for google ai studio text embeddings /embedContent endpoint
2024-08-27 18:14:56 -07:00
Krrish Dholakia
882ae8ac2c
fix: initial commit
2024-08-27 17:35:56 -07:00
Ishaan Jaff
fac610beab
add testing for cohere embeddings
2024-08-09 12:08:25 -07:00
Krrish Dholakia
cac91dcae4
fix(huggingface_restapi.py): fix hf embeddings optional param processing
2024-08-09 09:10:56 -07:00
Krrish Dholakia
d382de7b74
fix(huggingface_restapi.py): fixes issue where 'wait_for_model' was not being passed as expected
2024-08-09 08:36:35 -07:00
Krish Dholakia
ee375edcbf
Merge branch 'main' into litellm_async_cohere_calls
2024-07-30 15:35:20 -07:00
Krrish Dholakia
3a92468765
fix(cohere.py): support async cohere embedding calls
2024-07-30 14:49:07 -07:00
Krrish Dholakia
17ac0f0636
feat(huggingface_restapi.py): Support multiple hf embedding types + async hf embeddings
...
Closes https://github.com/BerriAI/litellm/issues/3261
2024-07-30 13:32:03 -07:00
Krrish Dholakia
dd429386b0
test: cleanup testing
2024-07-24 19:47:50 -07:00
Krrish Dholakia
d3953ac2ae
test(test_embedding.py): add simple azure embedding ad token test
...
Addresses https://github.com/BerriAI/litellm/issues/4859#issuecomment-2248838617
2024-07-24 13:38:03 -07:00
David Manouchehri
85f62f066a
(test_embedding.py) - Re-enable embedding test with Azure OIDC.
2024-07-24 16:41:24 +00:00
David Manouchehri
49645a057c
(tests) - Skip embedding Azure AD test for now.
2024-07-24 15:42:57 +00:00
Krrish Dholakia
2cddd09374
test(test_embedding.py): fix base url
2024-07-24 08:04:27 -07:00
David Manouchehri
592017acc9
(test - azure): Add test for Azure OIDC auth.
2024-07-23 19:12:40 +00:00
Ishaan Jaff
9ddd9d628a
fix bedrock embedding test
2024-07-20 20:05:22 -07:00
Ishaan Jaff
c6dec45107
fix triton embedding test
2024-07-17 17:29:22 -07:00
Krrish Dholakia
bad847043f
docs(supported_embeddings.md): add doc on provider-specific params for embedding models
2024-07-09 12:39:10 -07:00
Simon Sanchez Viloria
f93ece722d
(test - watsonx) use MagicMock to mock httpx.AsyncClient endpoint for aembedding test
2024-07-07 18:55:42 +02:00
Simon Sanchez Viloria
711e5a2dc2
(test - watsonx) Added tests for watsonx embeddings with mocked endpoints
2024-07-07 17:59:37 +02:00
Krrish Dholakia
e3c5e004c5
feat(databricks.py): add embedding model support
2024-05-23 18:22:03 -07:00
Krrish Dholakia
87a21115c5
test: handle watsonx rate limit error
2024-05-13 18:27:39 -07:00
Ishaan Jaff
fd1aa6f6f9
test triton embeddings
2024-05-10 18:50:34 -07:00
Krrish Dholakia
692a5c6aa6
test: skip local test
2024-04-27 19:07:49 -07:00
Simon Sanchez Viloria
2dd79eb30e
(test) Added completion and embedding tests for watsonx provider
2024-04-24 12:52:29 +02:00
Ishaan Jaff
91ec3478e1
test - voyage ai embedding
2024-04-03 20:54:35 -07:00
Krish Dholakia
09269005db
Merge pull request #2142 from vilmar-hillow/azure_embedding_ad_token
...
Fixed azure ad token not being processed properly in embedding models
2024-03-19 11:51:28 -07:00
Krrish Dholakia
235787ff67
fix(cohere.py): return usage as a pydantic object not dict
2024-03-15 10:00:22 -07:00
Dmitry Supranovich
f69ce1a6cf
Fixed azure ad token not being processed properly in embedding models
2024-03-12 21:29:24 -04:00
Krrish Dholakia
7f4dd734c1
fix(vertex_ai.py): correctly parse optional params and pass vertex ai project
2024-03-06 14:00:50 -08:00
Krrish Dholakia
dad65ca602
fix(bedrock.py): support anthropic messages api on bedrock (claude-3)
2024-03-04 17:15:47 -08:00
Krrish Dholakia
7457e03223
test: skip aws test - aws account suspended
2024-02-28 14:42:50 -08:00
Krrish Dholakia
129e84f0c5
test: removing aws tests - account suspended - pending their approval
2024-02-28 13:46:20 -08:00
Krrish Dholakia
7e2c7259e6
fix(vertex_ai.py): add async embedding support for vertex ai
2024-02-03 10:35:17 -08:00
Krrish Dholakia
d7ae8a2203
fix(test_embedding.py): fix test
2024-02-03 09:49:23 -08:00
Krrish Dholakia
3f23b18dad
feat(vertex_ai.py): vertex ai gecko text embedding support
2024-02-03 09:48:29 -08:00
ishaan-jaff
ff6459cf8a
(test) bedrock input validation - exceptions
2024-01-30 08:12:43 -08:00
ishaan-jaff
6da3a46670
(test) dimension param - openai
2024-01-26 10:37:01 -08:00
ishaan-jaff
303a8c1ca8
(chore) cleanup testing file
2024-01-25 14:36:11 -08:00
ishaan-jaff
95f34a31fa
(test) embedding models
2024-01-25 14:30:49 -08:00
Krrish Dholakia
17e65996de
fix(main.py): support custom pricing for embedding calls
2024-01-22 15:15:34 -08:00
ishaan-jaff
d5cfe5cefa
(test) aembedding includes custom_llm_provider
2024-01-12 17:52:14 -08:00
ishaan-jaff
f3a78ac0ea
(test) embedding hidden params
2024-01-12 17:35:33 -08:00
ishaan-jaff
eb8b00b2a5
(test) caching for bedrock/embedding str inputs
2024-01-11 23:12:57 +05:30
ishaan-jaff
b4665c80e5
(test) bedrock - embedding with strings
2024-01-11 23:04:41 +05:30