Commit graph

1828 commits

Author SHA1 Message Date
Krish Dholakia
7f64c61275 Merge pull request #3582 from BerriAI/litellm_explicit_region_name_setting
feat(router.py): allow setting model_region in litellm_params
2024-05-11 11:36:22 -07:00
Krrish Dholakia
9e0f5f4086 docs(reliability.md): add region based routing to proxy + sdk docs 2024-05-11 11:34:12 -07:00
Krish Dholakia
8ab9c861c9 Merge pull request #3369 from mogith-pn/main
Clarifai-LiteLLM : Added clarifai as LLM Provider.
2024-05-11 09:31:46 -07:00
Krish Dholakia
30117317a4 Merge branch 'main' into feat/add-azure-content-filter 2024-05-11 09:30:38 -07:00
Ishaan Jaff
b02f633cd6 Merge pull request #3577 from BerriAI/litellm_add_triton_server
[Feat] Add Triton Embeddings to LiteLLM
2024-05-10 19:20:23 -07:00
Ishaan Jaff
9c3064e3b9 docs triton embeddings support 2024-05-10 19:08:15 -07:00
Krish Dholakia
859d978a77 Merge pull request #3571 from BerriAI/litellm_hf_classifier_support
Huggingface classifier support
2024-05-10 17:54:27 -07:00
Krrish Dholakia
2e14cad180 docs(enterprise.md): add aws marketplace notice on docs 2024-05-10 15:54:29 -07:00
Krrish Dholakia
1b5a9a5c80 docs(huggingface.md): add text-classification to huggingface docs 2024-05-10 14:39:14 -07:00
Krrish Dholakia
65f8cf030c docs(predibase.md): add support for predibase to docs 2024-05-10 10:58:35 -07:00
Antonio Loison
fbc3832117 deps: remove diskcache from dependencies and add install in docs 2024-05-10 12:34:05 +02:00
Antonio Loison
59c8cf4a4f docs: add disk cache doc and update cache arguments 2024-05-10 12:17:03 +02:00
Krrish Dholakia
c83743d3af docs(customer_routing.md): add region-based routing for specific customers, to docs 2024-05-09 18:40:49 -07:00
Kade Heckel
dbd063169a Updated docs for Ollama. 2024-05-09 12:29:01 +01:00
Ishaan Jaff
88d0180d1f docs include stream_options param 2024-05-08 21:57:25 -07:00
Krrish Dholakia
b96a6b1c97 docs(hosted.md): add feature list 2024-05-08 09:53:13 -07:00
Krrish Dholakia
9e7094cae3 docs(users.md): simplify doc with end-user vs. internal user 2024-05-08 08:34:40 -07:00
Krrish Dholakia
91bb097136 docs(routing.md): make clear lowest cost routing is async 2024-05-07 21:34:18 -07:00
Ishaan Jaff
470f8f6de9 Merge pull request #3511 from BerriAI/litellm_router_send_exceptions_slack
[Feat] litellm.Router / litellm.completion -  send llm exceptions to slack
2024-05-07 19:09:44 -07:00
Ishaan Jaff
792863b433 docs setup alerting on router 2024-05-07 18:26:45 -07:00
Ishaan Jaff
aa1f912f6a Merge pull request #3503 from paul-gauthier/deepseek
Added "deepseek/" as a supported provider (openai compatible)
2024-05-07 15:15:47 -07:00
Ishaan Jaff
faea40886a docs - lowest cost routing 2024-05-07 13:15:30 -07:00
Ishaan Jaff
33d0f131a4 docs - lowest - latency routing 2024-05-07 12:43:44 -07:00
Paul Gauthier
c72e7e85e2 Added support for the deepseek api 2024-05-07 11:44:03 -07:00
Krish Dholakia
c2d8ee194d Merge pull request #3500 from ghaemisr/main
Added support for JWT auth with PEM cert public keys
2024-05-07 11:07:30 -07:00
Ishaan Jaff
b52232bdda Merge pull request #3464 from sepiatone/main
update langchain documentation to reflect refactor
2024-05-07 08:42:23 -07:00
Ishaan Jaff
2e4321b88e Merge pull request #3471 from paneru-rajan/Issue-3466-update-doc
Synced the doc with the Mistral
2024-05-07 08:41:18 -07:00
Sara Ghaemi
8706dd220f made audience optional and updated docs 2024-05-07 11:37:04 -04:00
sepiatone
597faceb5f update langchain documentation to reflect refactor 2024-05-07 17:01:55 +05:30
Krish Dholakia
72736599db Merge pull request #3459 from alexanderepstein/langfuse_improvements
Update support for langfuse metadata
2024-05-06 21:56:29 -07:00
Ishaan Jaff
f5ab34e246 docs - deploy litellm on gcp cloud run 2024-05-06 08:10:35 -07:00
Ishaan Jaff
0b069a314f docs - add using vertex embedding models 2024-05-06 07:56:17 -07:00
Rajan Paneru
3425aa7c43 bring back the to be depricated mistral-medium 2024-05-07 00:19:19 +09:30
Rajan Paneru
03ada978f3 Synced the model with the Mistral
* Ordered them based on previous order
* Excluded to be depricated models
* Use the model name, instead of the slug in the first column as referred by the name of the column
2024-05-06 13:16:24 +09:30
Alex Epstein
9ae08a8344 Update support for langfuse metadata
- Added ability to set trace release, version, metadata
- Added ability to update fields during a trace continuation
- Added ability to update input and output during a trace continuation
- Wrote new test for verifying metadata is set correctly
- Small improvement to setting secret boolean, prevent unnecessary literal_eval
- Small improvements to langfuse tests
2024-05-05 14:56:36 -04:00
Ishaan Jaff
b4358cc007 docs - set retry policy 2024-05-04 17:52:01 -07:00
Ishaan Jaff
1a96bbea64 Merge pull request #1530 from TanaroSch/main
change max_tokens type to int
2024-05-04 12:47:15 -07:00
alisalim17
f24d0aec4b docs: add documentation for logfire integration 2024-05-04 17:47:54 +04:00
Rajan Paneru
22ac1976c2 Improve the document of Traceloop
If we follow exact step I was getting two errors, which this Pull-Request will fix:
* Fixed the yaml snippet spacing issue: it was throwing error when running copy+paste code
* Added api_key: my-fake-key as to run litellm --config config.yaml --debug it will be required, otherwise it will fail

Need for Improvements:
* The traceloop is ambigous, because it's a company that maintains OpenLLMetry, and has observability solution and the sdk name is aslo traceloop-sdk
* The doc was missing several other observability solutions
* The steps were not quite obvious, added one more steps to make things clear
2024-05-04 22:42:53 +09:30
Lunik
ebb0ac4579 ✏️ doc: typo in azure content safety
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-04 10:45:15 +02:00
Lunik
d69a1eeb4f 📝 doc: Azure content safety Proxy usage
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-04 10:39:43 +02:00
Krrish Dholakia
001e7976da docs(exception_mapping.md): update exception mapping docs with 'should_retry' 2024-05-03 16:51:26 -07:00
Ishaan Jaff
d281d74067 docs - titan embeddings v2 2024-05-03 15:59:02 -07:00
Ishaan Jaff
532866a92d docs - simplify prod docs 2024-05-03 15:40:05 -07:00
Krish Dholakia
7e04447159 Merge pull request #3393 from Priva28/main
Add Llama3 tokenizer and allow custom tokenizers.
2024-05-02 16:32:41 -07:00
greenscale-nandesh
02f0c6bfc8 Merge branch 'main' into main 2024-05-02 08:22:11 -07:00
Christian Privitelli
f461f6b746 update docs, allow use with encode/decode 2024-05-02 14:56:38 +10:00
Krrish Dholakia
0b1382aef4 build(ui): enable adding openmeter via proxy ui 2024-05-01 21:16:23 -07:00
Krrish Dholakia
49a97cbf6b docs(openmeter.md): add openmeter to docs 2024-05-01 18:31:45 -07:00
Krrish Dholakia
846e971f5e docs(deploy.md): cleanup docker quick start docs 2024-05-01 10:00:49 -07:00