litellm-mirror

mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 11:14:04 +00:00

Author	SHA1	Message	Date
Krrish Dholakia	c83743d3af	docs(customer_routing.md): add region-based routing for specific customers, to docs	2024-05-09 18:40:49 -07:00
Ishaan Jaff	88d0180d1f	docs include `stream_options` param	2024-05-08 21:57:25 -07:00
Krrish Dholakia	b96a6b1c97	docs(hosted.md): add feature list	2024-05-08 09:53:13 -07:00
Krrish Dholakia	9e7094cae3	docs(users.md): simplify doc with end-user vs. internal user	2024-05-08 08:34:40 -07:00
Krrish Dholakia	91bb097136	docs(routing.md): make clear lowest cost routing is async	2024-05-07 21:34:18 -07:00
Ishaan Jaff	470f8f6de9	Merge pull request #3511 from BerriAI/litellm_router_send_exceptions_slack [Feat] litellm.Router / litellm.completion - send llm exceptions to slack	2024-05-07 19:09:44 -07:00
Ishaan Jaff	792863b433	docs setup alerting on router	2024-05-07 18:26:45 -07:00
Ishaan Jaff	aa1f912f6a	Merge pull request #3503 from paul-gauthier/deepseek Added "deepseek/" as a supported provider (openai compatible)	2024-05-07 15:15:47 -07:00
Ishaan Jaff	faea40886a	docs - lowest cost routing	2024-05-07 13:15:30 -07:00
Ishaan Jaff	33d0f131a4	docs - lowest - latency routing	2024-05-07 12:43:44 -07:00
Paul Gauthier	c72e7e85e2	Added support for the deepseek api	2024-05-07 11:44:03 -07:00
Krish Dholakia	c2d8ee194d	Merge pull request #3500 from ghaemisr/main Added support for JWT auth with PEM cert public keys	2024-05-07 11:07:30 -07:00
Ishaan Jaff	b52232bdda	Merge pull request #3464 from sepiatone/main update langchain documentation to reflect refactor	2024-05-07 08:42:23 -07:00
Ishaan Jaff	2e4321b88e	Merge pull request #3471 from paneru-rajan/Issue-3466-update-doc Synced the doc with the Mistral	2024-05-07 08:41:18 -07:00
Sara Ghaemi	8706dd220f	made audience optional and updated docs	2024-05-07 11:37:04 -04:00
sepiatone	597faceb5f	update langchain documentation to reflect refactor	2024-05-07 17:01:55 +05:30
Krish Dholakia	72736599db	Merge pull request #3459 from alexanderepstein/langfuse_improvements Update support for langfuse metadata	2024-05-06 21:56:29 -07:00
Ishaan Jaff	f5ab34e246	docs - deploy litellm on gcp cloud run	2024-05-06 08:10:35 -07:00
Ishaan Jaff	0b069a314f	docs - add using vertex embedding models	2024-05-06 07:56:17 -07:00
Rajan Paneru	3425aa7c43	bring back the to be depricated mistral-medium	2024-05-07 00:19:19 +09:30
Rajan Paneru	03ada978f3	Synced the model with the Mistral * Ordered them based on previous order * Excluded to be depricated models * Use the model name, instead of the slug in the first column as referred by the name of the column	2024-05-06 13:16:24 +09:30
Alex Epstein	9ae08a8344	Update support for langfuse metadata - Added ability to set trace release, version, metadata - Added ability to update fields during a trace continuation - Added ability to update input and output during a trace continuation - Wrote new test for verifying metadata is set correctly - Small improvement to setting secret boolean, prevent unnecessary literal_eval - Small improvements to langfuse tests	2024-05-05 14:56:36 -04:00
Ishaan Jaff	b4358cc007	docs - set retry policy	2024-05-04 17:52:01 -07:00
Ishaan Jaff	1a96bbea64	Merge pull request #1530 from TanaroSch/main change max_tokens type to int	2024-05-04 12:47:15 -07:00
Rajan Paneru	22ac1976c2	Improve the document of Traceloop If we follow exact step I was getting two errors, which this Pull-Request will fix: * Fixed the yaml snippet spacing issue: it was throwing error when running copy+paste code * Added api_key: my-fake-key as to run litellm --config config.yaml --debug it will be required, otherwise it will fail Need for Improvements: * The traceloop is ambigous, because it's a company that maintains OpenLLMetry, and has observability solution and the sdk name is aslo traceloop-sdk * The doc was missing several other observability solutions * The steps were not quite obvious, added one more steps to make things clear	2024-05-04 22:42:53 +09:30
Krrish Dholakia	001e7976da	docs(exception_mapping.md): update exception mapping docs with 'should_retry'	2024-05-03 16:51:26 -07:00
Ishaan Jaff	d281d74067	docs - titan embeddings v2	2024-05-03 15:59:02 -07:00
Ishaan Jaff	532866a92d	docs - simplify prod docs	2024-05-03 15:40:05 -07:00
Krish Dholakia	7e04447159	Merge pull request #3393 from Priva28/main Add Llama3 tokenizer and allow custom tokenizers.	2024-05-02 16:32:41 -07:00
greenscale-nandesh	02f0c6bfc8	Merge branch 'main' into main	2024-05-02 08:22:11 -07:00
Christian Privitelli	f461f6b746	update docs, allow use with encode/decode	2024-05-02 14:56:38 +10:00
Krrish Dholakia	0b1382aef4	build(ui): enable adding openmeter via proxy ui	2024-05-01 21:16:23 -07:00
Krrish Dholakia	49a97cbf6b	docs(openmeter.md): add openmeter to docs	2024-05-01 18:31:45 -07:00
Krrish Dholakia	846e971f5e	docs(deploy.md): cleanup docker quick start docs	2024-05-01 10:00:49 -07:00
Krrish Dholakia	faa139ff4c	docs(routing.md): add docs on lowest latency routing buffer	2024-04-30 22:41:50 -07:00
Ishaan Jaff	63e5b8764e	docs - slack alerting	2024-04-29 21:33:03 -07:00
Ishaan Jaff	a13fb2244e	docs example logging to langfuse	2024-04-29 21:26:27 -07:00
Ishaan Jaff	bf01cc535b	docs logging to langfuse on proxy	2024-04-29 21:26:15 -07:00
Ishaan Jaff	d3d62387f0	docs - fix kub.yaml config on docs	2024-04-29 21:20:29 -07:00
Krrish Dholakia	701be55872	feat(utils.py): json logs for raw request sent by litellm make it easier to view verbose logs in datadog	2024-04-29 19:21:19 -07:00
Krrish Dholakia	a048bcf88f	docs(langfuse_integration.md): add 'existing_trace_id' to langfuse docs	2024-04-29 16:40:38 -07:00
Krrish Dholakia	4f9e8d5aa1	docs(load_test.md): cleanup docs	2024-04-29 16:27:58 -07:00
Krrish Dholakia	963a59a786	docs(load_test.md): add multi-instance router load test to docs	2024-04-29 16:25:56 -07:00
Krrish Dholakia	e11a92c735	docs(load_test.md): simplify doc	2024-04-29 16:00:02 -07:00
Krrish Dholakia	0bae8911f8	docs(load_test.md): formatting	2024-04-29 15:58:41 -07:00
Krrish Dholakia	5b56a0856e	docs(load_test.md): load test multiple instances of the proxy w/ tpm/rpm limits on deployments	2024-04-29 15:58:14 -07:00
Krrish Dholakia	8ba4b73573	docs(routing.md): add max parallel requests to router docs	2024-04-29 15:37:48 -07:00
Ishaan Jaff	5dc911cba9	docs - track cost custom callbacks	2024-04-29 13:15:08 -07:00
Krrish Dholakia	27f76e263e	docs(vllm.md): update docs to tell people to check openai-compatible endpoint docs for vllm	2024-04-28 09:48:03 -07:00
Krish Dholakia	7ed64709aa	Merge branch 'main' into litellm_common_auth_params	2024-04-28 08:38:06 -07:00

1 2 3 4 5 ...

1710 commits