litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-27 03:34:10 +00:00

Author	SHA1	Message	Date
Ishaan Jaff	852132c0c7	fix handle user message	2024-09-12 14:34:32 -07:00
Ishaan Jaff	4e365443f2	fix linting	2024-09-12 14:15:18 -07:00
Ishaan Jaff	5fe95f7232	fix handle o1 not supporting system message	2024-09-12 14:09:13 -07:00
Ishaan Jaff	d845842071	add o1 reasoning tests	2024-09-12 13:40:15 -07:00
Ishaan Jaff	f71083bdad	add OpenAI o1 config	2024-09-12 13:22:59 -07:00
Krish Dholakia	7f47c48b35	LiteLLM Minor Fixes and Improvements (09/10/2024) (#5618 ) * fix(cost_calculator.py): move to debug for noisy warning message on cost calculation error Fixes https://github.com/BerriAI/litellm/issues/5610 * fix(databricks/cost_calculator.py): Handles model name issues for databricks models * fix(main.py): fix stream chunk builder for multiple tool calls Fixes https://github.com/BerriAI/litellm/issues/5591 * fix: correctly set user_alias when passed in Fixes https://github.com/BerriAI/litellm/issues/5612 * fix(types/utils.py): allow passing role for message object https://github.com/BerriAI/litellm/issues/5621 * fix(litellm_logging.py): Fix langfuse logging across multiple projects Fixes issue where langfuse logger was re-using the old logging object * feat(proxy/_types.py): support adding key-based tags for tag-based routing Enable tag based routing at key-level * fix(proxy/_types.py): fix inheritance * test(test_key_generate_prisma.py): fix test * test: fix test * fix(litellm_logging.py): return used callback object	2024-09-11 11:30:29 -07:00
Ishaan Jaff	4515f43976	Merge pull request #5623 from BerriAI/litellm_vertex_use_async_for_getting_token [Feat-Vertex Perf] Use async func to get auth credentials	2024-09-10 18:53:48 -07:00
Ishaan Jaff	cfe084c4f5	add doc string to vertex llm base	2024-09-10 18:52:43 -07:00
Ishaan Jaff	dbc85c3dfe	fix gemini streaming test	2024-09-10 17:50:24 -07:00
Ishaan Jaff	14a7b9d7c1	fix gemini test	2024-09-10 17:20:01 -07:00
Ishaan Jaff	c8fe600dbf	fix case when gemini is used	2024-09-10 17:06:45 -07:00
Ishaan Jaff	7891b3742c	fix vertex use async func to set auth creds	2024-09-10 16:12:18 -07:00
Ishaan Jaff	b7e11bb22e	Merge pull request #5622 from BerriAI/litellm_fix_auth_refresh_vertex [Feat-Perf Improvement Vertex] Only Refresh credentials when token is expired	2024-09-10 15:03:35 -07:00
Ishaan Jaff	d35e35bab6	fix bedrock get async client	2024-09-10 14:17:18 -07:00
Ishaan Jaff	3ac9711524	fix types for vertex project id	2024-09-10 14:02:15 -07:00
Ishaan Jaff	571490d0f2	fix getting params	2024-09-10 13:54:42 -07:00
Ishaan Jaff	0008c6fe6c	fix vertex only refresh auth when required	2024-09-10 13:49:28 -07:00
Ishaan Jaff	d0af5abdbf	fix linting error	2024-09-10 13:29:02 -07:00
Ishaan Jaff	5b72e385f1	fix get_async_httpx_client	2024-09-10 13:20:55 -07:00
Ishaan Jaff	5b06e54ea7	fix test	2024-09-10 13:15:50 -07:00
Ishaan Jaff	0d6081e370	pass llm provider when creating async httpx clients	2024-09-10 11:51:42 -07:00
Ishaan Jaff	93c1db4a79	rename get_async_httpx_client	2024-09-10 10:38:01 -07:00
Ishaan Jaff	21c462cf56	fix vertex ai use _get_async_client	2024-09-10 10:33:19 -07:00
Krish Dholakia	09ca581620	LiteLLM Minor Fixes and Improvements (09/09/2024) (#5602 ) * fix(main.py): pass default azure api version as alternative in completion call Fixes api error caused due to api version Closes https://github.com/BerriAI/litellm/issues/5584 * Fixed gemini-1.5-flash pricing (#5590) * add /key/list endpoint * bump: version 1.44.21 → 1.44.22 * docs architecture * Fixed gemini-1.5-flash pricing --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * fix(bedrock/chat.py): fix converse api stop sequence param mapping Fixes https://github.com/BerriAI/litellm/issues/5592 * fix(databricks/cost_calculator.py): handle databricks model name changes Fixes https://github.com/BerriAI/litellm/issues/5597 * fix(azure.py): support azure api version 2024-08-01-preview Closes https://github.com/BerriAI/litellm/issues/5377 * fix(proxy/_types.py): allow dev keys to call cohere /rerank endpoint Fixes issue where only admin could call rerank endpoint * fix(azure.py): check if model is gpt-4o * fix(proxy/_types.py): support /v1/rerank on non-admin routes as well * fix(cost_calculator.py): fix split on `/` logic in cost calculator --------- Co-authored-by: F1bos <44951186+F1bos@users.noreply.github.com> Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-09-09 21:56:12 -07:00
Krish Dholakia	52849e6422	LiteLLM Minor Fixes and Improvements (09/07/2024) (#5580 ) * fix(litellm_logging.py): set completion_start_time_float to end_time_float if none Fixes https://github.com/BerriAI/litellm/issues/5500 * feat(_init_.py): add new 'openai_text_completion_compatible_providers' list Fixes https://github.com/BerriAI/litellm/issues/5558 Handles correctly routing fireworks ai calls when done via text completions * fix: fix linting errors * fix: fix linting errors * fix(openai.py): fix exception raised * fix(openai.py): fix error handling * fix(_redis.py): allow all supported arguments for redis cluster (#5554) * Revert "fix(_redis.py): allow all supported arguments for redis cluster (#5554)" (#5583) This reverts commit `f2191ef4cb`. * fix(router.py): return model alias w/ underlying deployment on router.get_model_list() Fixes https://github.com/BerriAI/litellm/issues/5524#issuecomment-2336410666 * test: handle flaky tests --------- Co-authored-by: Jonas Dittrich <58814480+Kakadus@users.noreply.github.com>	2024-09-09 18:54:17 -07:00
Krish Dholakia	2cab33b061	LiteLLM Minor Fixes and Improvements (08/06/2024) (#5567 ) * fix(utils.py): return citations for perplexity streaming Fixes https://github.com/BerriAI/litellm/issues/5535 * fix(anthropic/chat.py): support fallbacks for anthropic streaming (#5542) * fix(anthropic/chat.py): support fallbacks for anthropic streaming Fixes https://github.com/BerriAI/litellm/issues/5512 * fix(anthropic/chat.py): use module level http client if none given (prevents early client closure) * fix: fix linting errors * fix(http_handler.py): fix raise_for_status error handling * test: retry flaky test * fix otel type * fix(bedrock/embed): fix error raising * test(test_openai_batches_and_files.py): skip azure batches test (for now) quota exceeded * fix(test_router.py): skip azure batch route test (for now) - hit batch quota limits --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com> * All `model_group_alias` should show up in `/models`, `/model/info` , `/model_group/info` (#5539) * fix(router.py): support returning model_alias model names in `/v1/models` * fix(proxy_server.py): support returning model alias'es on `/model/info` * feat(router.py): support returning model group alias for `/model_group/info` * fix(proxy_server.py): fix linting errors * fix(proxy_server.py): fix linting errors * build(model_prices_and_context_window.json): add amazon titan text premier pricing information Closes https://github.com/BerriAI/litellm/issues/5560 * feat(litellm_logging.py): log standard logging response object for pass through endpoints. Allows bedrock /invoke agent calls to be correctly logged to langfuse + s3 * fix(success_handler.py): fix linting error * fix(success_handler.py): fix linting errors * fix(team_endpoints.py): Allows admin to update team member budgets --------- Co-authored-by: Ishaan Jaff <ishaanjaffer0324@gmail.com>	2024-09-06 17:16:24 -07:00
Krish Dholakia	c910a32439	LiteLLM Minor Fixes and Improvements (#5537 ) * fix(vertex_ai): Fixes issue where multimodal message without text was failing vertex calls Fixes https://github.com/BerriAI/litellm/issues/5515 * fix(azure.py): move to using httphandler for oidc token calls Fixes issue where ssl certificates weren't being picked up as expected Closes https://github.com/BerriAI/litellm/issues/5522 * feat: Allows admin to set a default_max_internal_user_budget in config, and allow setting more specific values as env vars * fix(proxy_server.py): fix read for max_internal_user_budget * build(model_prices_and_context_window.json): add regional gpt-4o-2024-08-06 pricing Closes https://github.com/BerriAI/litellm/issues/5540 * test: skip re-test	2024-09-05 18:21:42 -07:00
Krish Dholakia	a1fdf9d6d4	LiteLLM Merged PR's (#5538 ) * Fix typo in #5509 (#5532) * Reapply "(bedrock): Fix usage with Cloudflare AI Gateway, and proxies in gener…" (#5519) This reverts commit `995019c08a`. * (bedrock): Fix obvious typo * test: cleanup linting error --------- Co-authored-by: David Manouchehri <david.manouchehri@ai.moda>	2024-09-05 18:21:42 -07:00
Ishaan Jaff	e2b848ed12	fix import	2024-09-05 14:42:56 -07:00
Ishaan Jaff	bb801f084d	fix import	2024-09-05 10:41:13 -07:00
Ishaan Jaff	6f68e860e0	fix import error	2024-09-05 10:09:44 -07:00
Ishaan Jaff	7370a994f5	use correct type hints for audio transcriptions	2024-09-05 09:12:27 -07:00
Krish Dholakia	6fdee99632	LiteLLM Minor fixes + improvements (08/04/2024) (#5505 ) * Minor IAM AWS OIDC Improvements (#5246) * AWS IAM: Temporary tokens are valid across all regions after being issued, so it is wasteful to request one for each region. * AWS IAM: Include an inline policy, to help reduce misuse of overly permissive IAM roles. * (test_bedrock_completion.py): Ensure we are testing cross AWS region OIDC flow. * fix(router.py): log rejected requests Fixes https://github.com/BerriAI/litellm/issues/5498 * refactor: don't use verbose_logger.exception, if exception is raised User might already have handling for this. But alerting systems in prod will raise this as an unhandled error. * fix(datadog.py): support setting datadog source as an env var Fixes https://github.com/BerriAI/litellm/issues/5508 * docs(logging.md): add dd_source to datadog docs * fix(proxy_server.py): expose `/customer/list` endpoint for showing all customers * (bedrock): Fix usage with Cloudflare AI Gateway, and proxies in general. (#5509) * feat(anthropic.py): support 'cache_control' param for content when it is a string * Revert "(bedrock): Fix usage with Cloudflare AI Gateway, and proxies in gener…" (#5519) This reverts commit `3fac0349c2`. * refactor: ci/cd run again --------- Co-authored-by: David Manouchehri <david.manouchehri@ai.moda>	2024-09-04 22:16:55 -07:00
Krish Dholakia	0595d03116	security - Prevent sql injection in `/team/update` query (#5513 ) * fix(team_endpoints.py): replace `.get_data()` usage with prisma interface Prevent sql injection in `/team/update` query Fixes https://huntr.com/bounties/a4f6d357-5b44-4e00-9cac-f1cc351211d2 * fix(vertex_ai_non_gemini.py): handle message being a pydantic model	2024-09-04 16:03:02 -07:00
Krish Dholakia	8eb7cb5300	LiteLLM Minor fixes + improvements (08/03/2024) (#5488 ) * fix(internal_user_endpoints.py): set budget_reset_at for /user/update * fix(vertex_and_google_ai_studio_gemini.py): handle accumulated json Fixes https://github.com/BerriAI/litellm/issues/5479 * fix(vertex_ai_and_gemini.py): fix assistant message function call when content is not None Fixes https://github.com/BerriAI/litellm/issues/5490 * fix(proxy_server.py): generic state uuid for okta sso * fix(lago.py): improve debug logs Debugging for https://github.com/BerriAI/litellm/issues/5477 * docs(bedrock.md): add bedrock cross-region inferencing to docs * fix(azure.py): return azure response headers on aembedding call * feat(azure.py): return azure response headers for `/audio/transcription` * fix(types/utils.py): standardize deepseek / anthropic prompt caching usage information Closes https://github.com/BerriAI/litellm/issues/5285 * docs(usage.md): add docs on litellm usage object * test(test_completion.py): mark flaky test	2024-09-03 21:21:34 -07:00
Ishaan Jaff	09519b74db	refactor get_secret	2024-09-03 10:42:12 -07:00
Ishaan Jaff	9c14d63697	Merge branch 'main' into litellm_track_imagen_spend_logs	2024-09-02 21:21:15 -07:00
Ishaan Jaff	778cba702e	fix linting errors	2024-09-02 19:39:10 -07:00
Ishaan Jaff	e3becc6514	refactor vtx image gen	2024-09-02 17:35:51 -07:00
Ishaan Jaff	e60c7a3b85	track /embedding in spendLogs	2024-09-02 17:05:53 -07:00
Ishaan Jaff	dc1b0ec182	Merge pull request #5478 from BerriAI/litellm_Add_ai21 [Feat] Add AI21 /chat API	2024-09-02 16:20:37 -07:00
Krish Dholakia	11f85d883f	LiteLLM Minor Fixes + Improvements (#5474 ) * feat(proxy/_types.py): add lago billing to callbacks ui Closes https://github.com/BerriAI/litellm/issues/5472 * fix(anthropic.py): return anthropic prompt caching information Fixes https://github.com/BerriAI/litellm/issues/5364 * feat(bedrock/chat.py): support 'json_schema' for bedrock models Closes https://github.com/BerriAI/litellm/issues/5434 * fix(bedrock/embed/embeddings.py): support async embeddings for amazon titan models * fix: linting fixes * fix: handle key errors * fix(bedrock/chat.py): fix bedrock ai21 streaming object * feat(bedrock/embed): support bedrock embedding optional params * fix(databricks.py): fix usage chunk * fix(internal_user_endpoints.py): apply internal user defaults, if user role updated Fixes issue where user update wouldn't apply defaults * feat(slack_alerting.py): provide multiple slack channels for a given alert type multiple channels might be interested in receiving an alert for a given type * docs(alerting.md): add multiple channel alerting to docs	2024-09-02 14:29:57 -07:00
Ishaan Jaff	e1dacde1ec	add all ai21 params	2024-09-02 11:54:40 -07:00
Ishaan Jaff	a33206c939	refactor ai21	2024-09-02 11:47:04 -07:00
David Manouchehri	6375affeea	(gemini): Fix Cloudflare AI Gateway typo. (#5429 )	2024-09-02 07:52:18 -07:00
Simon S. Viloria	1c9a82771a	fix response_format={'type': 'json_object'} not working for Azure models (#5468 )	2024-09-01 13:31:13 -07:00
Krish Dholakia	e474c3665a	Bedrock Embeddings refactor + model support (#5462 ) * refactor(bedrock): initial commit to refactor bedrock to a folder Improve code readability + maintainability * refactor: more refactor work * fix: fix imports * feat(bedrock/embeddings.py): support translating embedding into amazon embedding formats * fix: fix linting errors * test: skip test on end of life model * fix(cohere/embed.py): fix linting error * fix(cohere/embed.py): fix typing * fix(cohere/embed.py): fix post-call logging for cohere embedding call * test(test_embeddings.py): fix error message assertion in test	2024-09-01 13:29:58 -07:00
Krish Dholakia	e12bd3e548	Minor LiteLLM Fixes and Improvements (#5456 ) * fix(utils.py): support 'drop_params' for embedding requests Fixes https://github.com/BerriAI/litellm/issues/5444 * feat(vertex_ai_non_gemini.py): support function param in messages * test: skip test - model end of life * fix(vertex_ai_non_gemini.py): fix gemini history parsing	2024-08-31 17:58:10 -07:00
Krish Dholakia	aa9f1896c6	anthropic prompt caching cost tracking (#5453 ) * fix(utils.py): support 'drop_params' for embedding requests Fixes https://github.com/BerriAI/litellm/issues/5444 * feat(anthropic/cost_calculation.py): Support calculating cost for prompt caching on anthropic * feat(types/utils.py): allows us to migrate to openai's equivalent, once that comes out * fix: fix linting errors * test: mark flaky test	2024-08-31 14:50:12 -07:00
Ishaan Jaff	ccbbb4f3f8	add cerebras config	2024-08-31 08:34:46 -07:00

1 2 3 4 5 ...

1672 commits