litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 11:43:54 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	3560f0ef2c	refactor: move all testing to top-level of repo Closes https://github.com/BerriAI/litellm/issues/486	2024-09-28 21:08:14 -07:00
Krish Dholakia	bd17424c4b	LiteLLM Minor Fixes & Improvements (09/26/2024) (#5925 ) (#5937 ) * LiteLLM Minor Fixes & Improvements (09/26/2024) (#5925) * fix(litellm_logging.py): don't initialize prometheus_logger if non premium user Prevents bad error messages in logs Fixes https://github.com/BerriAI/litellm/issues/5897 * Add Support for Custom Providers in Vision and Function Call Utils (#5688) * Add Support for Custom Providers in Vision and Function Call Utils Lookup * Remove parallel function call due to missing model info param * Add Unit Tests for Vision and Function Call Changes * fix-#5920: set header value to string to fix "'int' object has no att… (#5922) * LiteLLM Minor Fixes & Improvements (09/24/2024) (#5880) * LiteLLM Minor Fixes & Improvements (09/23/2024) (#5842) * feat(auth_utils.py): enable admin to allow client-side credentials to be passed Makes it easier for devs to experiment with finetuned fireworks ai models * feat(router.py): allow setting configurable_clientside_auth_params for a model Closes https://github.com/BerriAI/litellm/issues/5843 * build(model_prices_and_context_window.json): fix anthropic claude-3-5-sonnet max output token limit Fixes https://github.com/BerriAI/litellm/issues/5850 * fix(azure_ai/): support content list for azure ai Fixes https://github.com/BerriAI/litellm/issues/4237 * fix(litellm_logging.py): always set saved_cache_cost Set to 0 by default * fix(fireworks_ai/cost_calculator.py): add fireworks ai default pricing handles calling 405b+ size models * fix(slack_alerting.py): fix error alerting for failed spend tracking Fixes regression with slack alerting error monitoring * fix(vertex_and_google_ai_studio_gemini.py): handle gemini no candidates in streaming chunk error * docs(bedrock.md): add llama3-1 models * test: fix tests * fix(azure_ai/chat): fix transformation for azure ai calls * feat(azure_ai/embed): Add azure ai embeddings support Closes https://github.com/BerriAI/litellm/issues/5861 * fix(azure_ai/embed): enable async embedding * feat(azure_ai/embed): support azure ai multimodal embeddings * fix(azure_ai/embed): support async multi modal embeddings * feat(together_ai/embed): support together ai embedding calls * feat(rerank/main.py): log source documents for rerank endpoints to langfuse improves rerank endpoint logging * fix(langfuse.py): support logging `/audio/speech` input to langfuse * test(test_embedding.py): fix test * test(test_completion_cost.py): fix helper util * fix-#5920: set header value to string to fix "'int' object has no attribute 'encode'" --------- Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> * Revert "fix-#5920: set header value to string to fix "'int' object has no att…" (#5926) This reverts commit `a554ae2695`. * build(model_prices_and_context_window.json): add azure ai cohere rerank model pricing Enables cost tracking for azure ai cohere rerank models * fix(litellm_logging.py): fix debug log to be clearer Closes https://github.com/BerriAI/litellm/issues/5909 * test(test_utils.py): fix test name * fix(azure_ai/cost_calculator.py): support cost tracking for azure ai rerank models * fix(azure_ai): fix azure ai base model cost tracking for rerank endpoints * fix(converse_handler.py): support new llama 3-2 models Fixes https://github.com/BerriAI/litellm/issues/5901 * fix(litellm_logging.py): ensure response is redacted for standard message logging Fixes https://github.com/BerriAI/litellm/issues/5890#issuecomment-2378242360 * fix(cost_calculator.py): use 'get_model_info' for cohere rerank cost calculation allows user to set custom cost for model * fix(config.yml): fix docker hub auht * build(config.yml): add docker auth to all tests * fix(db/create_views.py): fix linting error * fix(main.py): fix circular import * fix(azure_ai/__init__.py): fix circular import * fix(main.py): fix import * fix: fix linting errors * test: fix test * fix(proxy_server.py): pass premium user value on startup used for prometheus init --------- Co-authored-by: Cole Murray <colemurray.cs@gmail.com> Co-authored-by: bravomark <62681807+bravomark@users.noreply.github.com> * handle streaming for azure ai studio error * [Perf Proxy] parallel request limiter - use one cache update call (#5932) * fix parallel request limiter - use one cache update call * ci/cd run again * run ci/cd again * use docker username password * fix config.yml * fix config * fix config * fix config.yml * ci/cd run again * use correct typing for batch set cache * fix async_set_cache_pipeline * fix only check user id tpm / rpm limits when limits set * fix test_openai_azure_embedding_with_oidc_and_cf * test: fix test * test(test_rerank.py): fix test --------- Co-authored-by: Cole Murray <colemurray.cs@gmail.com> Co-authored-by: bravomark <62681807+bravomark@users.noreply.github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-09-27 17:54:13 -07:00
Krish Dholakia	da77706c26	Litellm stable dev (#5711 ) * feat(aws_base_llm.py): prevents recreating boto3 credentials during high traffic Leads to 100ms perf boost in local testing * fix(base_aws_llm.py): fix credential caching check to see if token is set * refactor(bedrock/chat): separate converse api and invoke api + isolate converse api transformation logic Make it easier to see how requests are transformed for /converse * fix: fix imports * fix(bedrock/embed): fix reordering of headers * fix(base_aws_llm.py): fix get credential logic * fix(converse_handler.py): fix ai21 streaming response	2024-09-14 23:22:59 -07:00
Krish Dholakia	4657a40ef1	LiteLLM Minor Fixes and Improvements (09/12/2024) (#5658 ) * fix(factory.py): handle tool call content as list Fixes https://github.com/BerriAI/litellm/issues/5652 * fix(factory.py): enforce stronger typing * fix(router.py): return model alias in /v1/model/info and /v1/model_group/info * fix(user_api_key_auth.py): move noisy warning message to debug cleanup logs * fix(types.py): cleanup pydantic v2 deprecated param Fixes https://github.com/BerriAI/litellm/issues/5649 * docs(gemini.md): show how to pass inline data to gemini api Fixes https://github.com/BerriAI/litellm/issues/5674	2024-09-12 23:04:06 -07:00
Krish Dholakia	1e7e538261	LiteLLM Minor fixes + improvements (08/04/2024) (#5505 ) * Minor IAM AWS OIDC Improvements (#5246) * AWS IAM: Temporary tokens are valid across all regions after being issued, so it is wasteful to request one for each region. * AWS IAM: Include an inline policy, to help reduce misuse of overly permissive IAM roles. * (test_bedrock_completion.py): Ensure we are testing cross AWS region OIDC flow. * fix(router.py): log rejected requests Fixes https://github.com/BerriAI/litellm/issues/5498 * refactor: don't use verbose_logger.exception, if exception is raised User might already have handling for this. But alerting systems in prod will raise this as an unhandled error. * fix(datadog.py): support setting datadog source as an env var Fixes https://github.com/BerriAI/litellm/issues/5508 * docs(logging.md): add dd_source to datadog docs * fix(proxy_server.py): expose `/customer/list` endpoint for showing all customers * (bedrock): Fix usage with Cloudflare AI Gateway, and proxies in general. (#5509) * feat(anthropic.py): support 'cache_control' param for content when it is a string * Revert "(bedrock): Fix usage with Cloudflare AI Gateway, and proxies in gener…" (#5519) This reverts commit `3fac0349c2`. * refactor: ci/cd run again --------- Co-authored-by: David Manouchehri <david.manouchehri@ai.moda>	2024-09-04 22:16:55 -07:00
Krish Dholakia	be3c7b401e	LiteLLM Minor fixes + improvements (08/03/2024) (#5488 ) * fix(internal_user_endpoints.py): set budget_reset_at for /user/update * fix(vertex_and_google_ai_studio_gemini.py): handle accumulated json Fixes https://github.com/BerriAI/litellm/issues/5479 * fix(vertex_ai_and_gemini.py): fix assistant message function call when content is not None Fixes https://github.com/BerriAI/litellm/issues/5490 * fix(proxy_server.py): generic state uuid for okta sso * fix(lago.py): improve debug logs Debugging for https://github.com/BerriAI/litellm/issues/5477 * docs(bedrock.md): add bedrock cross-region inferencing to docs * fix(azure.py): return azure response headers on aembedding call * feat(azure.py): return azure response headers for `/audio/transcription` * fix(types/utils.py): standardize deepseek / anthropic prompt caching usage information Closes https://github.com/BerriAI/litellm/issues/5285 * docs(usage.md): add docs on litellm usage object * test(test_completion.py): mark flaky test	2024-09-03 21:21:34 -07:00
Krish Dholakia	37f9705d6e	Bedrock Embeddings refactor + model support (#5462 ) * refactor(bedrock): initial commit to refactor bedrock to a folder Improve code readability + maintainability * refactor: more refactor work * fix: fix imports * feat(bedrock/embeddings.py): support translating embedding into amazon embedding formats * fix: fix linting errors * test: skip test on end of life model * fix(cohere/embed.py): fix linting error * fix(cohere/embed.py): fix typing * fix(cohere/embed.py): fix post-call logging for cohere embedding call * test(test_embeddings.py): fix error message assertion in test	2024-09-01 13:29:58 -07:00
Krish Dholakia	dd7b008161	fix: Minor LiteLLM Fixes + Improvements (29/08/2024) (#5436 ) * fix(model_checks.py): support returning wildcard models on `/v1/models` Fixes https://github.com/BerriAI/litellm/issues/4903 * fix(bedrock_httpx.py): support calling bedrock via api_base Closes https://github.com/BerriAI/litellm/pull/4587 * fix(litellm_logging.py): only leave last 4 char of gemini key unmasked Fixes https://github.com/BerriAI/litellm/issues/5433 * feat(router.py): support setting 'weight' param for models on router Closes https://github.com/BerriAI/litellm/issues/5410 * test(test_bedrock_completion.py): add unit test for custom api base * fix(model_checks.py): handle no "/" in model	2024-08-29 22:40:25 -07:00
Krrish Dholakia	6431af0678	fix(bedrock_httpx.py): support 'Auth' header as extra_header Fixes https://github.com/BerriAI/litellm/issues/5389#issuecomment-2313677977	2024-08-27 16:08:54 -07:00
Krrish Dholakia	70bf8bd4f4	feat(factory.py): enable 'user_continue_message' for interweaving user/assistant messages when provider requires it allows bedrock to be used with autogen	2024-08-22 11:03:33 -07:00
Ishaan Jaff	89ba7b3e11	pass trace through for bedrock guardrails	2024-08-16 09:10:56 -07:00
Krrish Dholakia	c1279ed809	fix(bedrock_httpx.py): fix error code for not found provider/model combo to be 404	2024-08-13 20:36:12 -07:00
Krrish Dholakia	66d77f177f	fix(bedrock_httpx.py): raise bad request error if invalid bedrock model given	2024-08-13 19:27:06 -07:00
Krrish Dholakia	526b196f83	fix(bedrock_httpx.py): handle empty stop string	2024-08-13 07:30:30 -07:00
Krrish Dholakia	6e8d2856b0	fix(bedrock_httpx.py): handle bedrock empty system message	2024-08-13 07:17:17 -07:00
Ishaan Jaff	42617c207a	test bedrock tool call names	2024-08-09 17:14:56 -07:00
Ishaan Jaff	6dc9b39095	test invalid tool namehandling	2024-08-09 13:26:21 -07:00
David Manouchehri	507529e8df	(test_bedrock_completion.py) - Use FIPS endpoints for testing.	2024-07-31 16:51:58 +00:00
Ishaan Jaff	46555ab78b	test - bedrock guardrailConfig	2024-07-29 14:13:08 -07:00
Krrish Dholakia	e2d275f1b7	fix(utils.py): add exception mapping for bedrock image internal server error	2024-07-19 19:30:41 -07:00
Krrish Dholakia	0decc36bed	fix(factory.py): handle message content being a list instead of string Fixes https://github.com/BerriAI/litellm/issues/4679	2024-07-12 19:00:39 -07:00
Krrish Dholakia	88eb25da5c	fix(bedrock_httpx.py): handle user error - malformed system prompt if user passes in system prompt as a list of content blocks, handle that	2024-07-12 08:28:50 -07:00
Krrish Dholakia	79670ab82e	fix(main.py): get the region name from boto3 client if dynamic var not set	2024-07-02 09:24:07 -07:00
Ishaan Jaff	bad49a270d	fix test test_provisioned_throughput	2024-06-29 19:41:05 -07:00
Brian Schultheiss	632b7ce17d	Resolve merge conflicts	2024-06-29 15:53:02 -07:00
Krrish Dholakia	151d19960e	fix(bedrock_httpx.py): Fix https://github.com/BerriAI/litellm/issues/4415	2024-06-26 16:19:46 -07:00
Brian Schultheiss	09492cceba	Update tests to verify streaming works	2024-06-25 14:33:40 -07:00
Brian Schultheiss	5a6588342c	added test for change	2024-06-23 15:19:54 -07:00
David Manouchehri	02aaaf5976	Merge remote-tracking branch 'upstream/main' into oidc-bedrock-httpx-caching-part-1	2024-06-11 15:42:31 +00:00
Krrish Dholakia	e66b3d264f	fix(factory.py): handle bedrock claude image url's	2024-06-07 10:04:03 -07:00
David Manouchehri	3410367610	test(test_bedrock_completion.py): Add tests to ensure caching isn't breaking anything.	2024-06-01 15:22:08 +00:00
David Manouchehri	d70d484e10	Fix: Use David's AWS account to pass unit tests.	2024-05-31 13:50:49 +00:00
David Manouchehri	08ee4519b6	Add unit test for bedrock httpx oidc auth.	2024-05-31 12:44:53 +00:00
Krrish Dholakia	6b50e656b8	fix(main.py): pass extra headers through for async calls	2024-05-27 19:11:40 -07:00
Krrish Dholakia	24eb79da91	test(test_bedrock_completion.py): refactor test bedrock headers test	2024-05-27 19:01:07 -07:00
Krrish Dholakia	d2e14ca833	fix(bedrock_httpx.py): fix bedrock ptu model id str encoding Fixes https://github.com/BerriAI/litellm/issues/3805	2024-05-25 10:54:01 -07:00
Krrish Dholakia	00af8e350f	fix(bedrock_httpx.py): support bedrock ptu's Fixes https://github.com/BerriAI/litellm/issues/3805	2024-05-24 23:02:04 -07:00
Ishaan Jaff	0ddaf320ef	fix test - retry claude-3 image error 3 times	2024-05-20 16:17:09 -07:00
Ishaan Jaff	d77aea7253	Update test_bedrock_completion.py cc @Manouchehri - can u lmk what needs to be in our env to pass this test ? attaching the test log here: `cda0de1d`-3851-469c-8851-ef12dc27fab2/jobs/20819/tests#failed-test-0	2024-05-11 16:30:29 -07:00
David Manouchehri	3ee0328b04	feat(bedrock.py): Support using OIDC tokens.	2024-05-07 15:46:54 +00:00
Lucca Zenóbio	b22517845e	Merge branch 'main' into main	2024-05-06 09:40:23 -03:00
Krrish Dholakia	09d7121af2	fix(bedrock.py): map finish reason for bedrock	2024-05-04 12:45:40 -07:00
Lucca Zenobio	a9e2ef6212	test	2024-04-29 10:05:30 -03:00
Nilanjan De	5113d47023	add test	2024-04-19 00:42:48 +04:00
Ishaan Jaff	5393930701	fix function calling prompt - ask llm to respond in fahrenheit	2024-04-16 21:09:53 -07:00
Krrish Dholakia	2fabff06c0	fix(bedrock.py): fix supported openai params for bedrock claude 3	2024-03-23 16:02:15 -07:00
Krrish Dholakia	caa17d484a	fix(bedrock.py): working image calls to claude 3	2024-03-04 18:12:47 -08:00
Krrish Dholakia	818c29516d	fix(bedrock.py): support bedrock anthropic claude 3 tool calling	2024-03-04 17:47:28 -08:00
Krrish Dholakia	478307d4cf	fix(bedrock.py): support anthropic messages api on bedrock (claude-3)	2024-03-04 17:15:47 -08:00
Tim Xia	2321f19fe7	comment out tests	2024-03-01 23:28:25 -05:00

1 2

64 commits