Krrish Dholakia
7276c6eb1e
docs(token_auth.md): add end user cost tracking to jwt auth docs
2024-05-11 21:28:31 -07:00
Ishaan Jaff
2eb4508204
fix mark (BETA) Azure Content Safety
2024-05-11 17:51:21 -07:00
Ishaan Jaff
b9b8bf52f3
Merge pull request #3581 from BerriAI/litellm_log_metadata_langfuse_traces
...
[Feat] - log metadata on traces + allow users to log metadata when `existing_trace_id` exists
2024-05-11 14:19:48 -07:00
Ishaan Jaff
360d284058
docs - debug langfuse
2024-05-11 14:12:17 -07:00
Ishaan Jaff
bf909a89f8
Merge pull request #3585 from BerriAI/litellm_router_batch_comp
...
[Litellm Proxy + litellm.Router] - Pass the same message/prompt to N models
2024-05-11 13:51:45 -07:00
Ishaan Jaff
62276fc221
docs link to litellm batch completions
2024-05-11 13:45:32 -07:00
Ishaan Jaff
5918ee543b
docs - batch completion litellm proxy
2024-05-11 13:42:41 -07:00
Krish Dholakia
86d0c0ae4e
Merge pull request #3582 from BerriAI/litellm_explicit_region_name_setting
...
feat(router.py): allow setting model_region in litellm_params
2024-05-11 11:36:22 -07:00
Krrish Dholakia
0c87bb5adf
docs(reliability.md): add region based routing to proxy + sdk docs
2024-05-11 11:34:12 -07:00
Krish Dholakia
8f6ae9a059
Merge pull request #3369 from mogith-pn/main
...
Clarifai-LiteLLM : Added clarifai as LLM Provider.
2024-05-11 09:31:46 -07:00
Krish Dholakia
bbe1300c5b
Merge branch 'main' into feat/add-azure-content-filter
2024-05-11 09:30:38 -07:00
Ishaan Jaff
b09075da53
Merge pull request #3577 from BerriAI/litellm_add_triton_server
...
[Feat] Add Triton Embeddings to LiteLLM
2024-05-10 19:20:23 -07:00
Ishaan Jaff
b913620997
docs triton embeddings support
2024-05-10 19:08:15 -07:00
Krish Dholakia
1aa567f3b5
Merge pull request #3571 from BerriAI/litellm_hf_classifier_support
...
Huggingface classifier support
2024-05-10 17:54:27 -07:00
Krrish Dholakia
30d2df8940
docs(enterprise.md): add aws marketplace notice on docs
2024-05-10 15:54:29 -07:00
Krrish Dholakia
d4d175030f
docs(huggingface.md): add text-classification to huggingface docs
2024-05-10 14:39:14 -07:00
Krrish Dholakia
781d5888c3
docs(predibase.md): add support for predibase to docs
2024-05-10 10:58:35 -07:00
Krrish Dholakia
5a38438c3f
docs(customer_routing.md): add region-based routing for specific customers, to docs
2024-05-09 18:40:49 -07:00
Ishaan Jaff
e7e54772ae
docs include stream_options
param
2024-05-08 21:57:25 -07:00
Krrish Dholakia
c5897543c8
docs(hosted.md): add feature list
2024-05-08 09:53:13 -07:00
Krrish Dholakia
59080431b8
docs(users.md): simplify doc with end-user vs. internal user
2024-05-08 08:34:40 -07:00
Krrish Dholakia
67b4aa28bd
docs(routing.md): make clear lowest cost routing is async
2024-05-07 21:34:18 -07:00
Ishaan Jaff
223e386737
Merge pull request #3511 from BerriAI/litellm_router_send_exceptions_slack
...
[Feat] litellm.Router / litellm.completion - send llm exceptions to slack
2024-05-07 19:09:44 -07:00
Ishaan Jaff
d46544d2bc
docs setup alerting on router
2024-05-07 18:26:45 -07:00
Ishaan Jaff
17787db973
Merge pull request #3503 from paul-gauthier/deepseek
...
Added "deepseek/" as a supported provider (openai compatible)
2024-05-07 15:15:47 -07:00
Ishaan Jaff
d5f93048cc
docs - lowest cost routing
2024-05-07 13:15:30 -07:00
Ishaan Jaff
4c909194c7
docs - lowest - latency routing
2024-05-07 12:43:44 -07:00
Paul Gauthier
90eb0ea022
Added support for the deepseek api
2024-05-07 11:44:03 -07:00
Krish Dholakia
93e5fb49d3
Merge pull request #3500 from ghaemisr/main
...
Added support for JWT auth with PEM cert public keys
2024-05-07 11:07:30 -07:00
Ishaan Jaff
ee1b1fe4f8
Merge pull request #3464 from sepiatone/main
...
update langchain documentation to reflect refactor
2024-05-07 08:42:23 -07:00
Ishaan Jaff
8b15507add
Merge pull request #3471 from paneru-rajan/Issue-3466-update-doc
...
Synced the doc with the Mistral
2024-05-07 08:41:18 -07:00
Sara Ghaemi
66b2b5fab9
made audience optional and updated docs
2024-05-07 11:37:04 -04:00
sepiatone
1518a7b42d
update langchain documentation to reflect refactor
2024-05-07 17:01:55 +05:30
Krish Dholakia
30003afbf8
Merge pull request #3459 from alexanderepstein/langfuse_improvements
...
Update support for langfuse metadata
2024-05-06 21:56:29 -07:00
Ishaan Jaff
ec63a30095
docs - deploy litellm on gcp cloud run
2024-05-06 08:10:35 -07:00
Ishaan Jaff
e0001a9121
docs - add using vertex embedding models
2024-05-06 07:56:17 -07:00
Rajan Paneru
2347cb2247
bring back the to be depricated mistral-medium
2024-05-07 00:19:19 +09:30
Rajan Paneru
d0d063e25a
Synced the model with the Mistral
...
* Ordered them based on previous order
* Excluded to be depricated models
* Use the model name, instead of the slug in the first column as referred by the name of the column
2024-05-06 13:16:24 +09:30
Alex Epstein
b82162832a
Update support for langfuse metadata
...
- Added ability to set trace release, version, metadata
- Added ability to update fields during a trace continuation
- Added ability to update input and output during a trace continuation
- Wrote new test for verifying metadata is set correctly
- Small improvement to setting secret boolean, prevent unnecessary literal_eval
- Small improvements to langfuse tests
2024-05-05 14:56:36 -04:00
Ishaan Jaff
bbf5d79069
docs - set retry policy
2024-05-04 17:52:01 -07:00
Ishaan Jaff
d968dedd77
Merge pull request #1530 from TanaroSch/main
...
change max_tokens type to int
2024-05-04 12:47:15 -07:00
Rajan Paneru
7d9377f18a
Improve the document of Traceloop
...
If we follow exact step I was getting two errors, which this Pull-Request will fix:
* Fixed the yaml snippet spacing issue: it was throwing error when running copy+paste code
* Added api_key: my-fake-key as to run litellm --config config.yaml --debug it will be required, otherwise it will fail
Need for Improvements:
* The traceloop is ambigous, because it's a company that maintains OpenLLMetry, and has observability solution and the sdk name is aslo traceloop-sdk
* The doc was missing several other observability solutions
* The steps were not quite obvious, added one more steps to make things clear
2024-05-04 22:42:53 +09:30
Lunik
ebbeb333c6
✏️ doc: typo in azure content safety
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-04 10:45:15 +02:00
Lunik
cb178723ca
📝 doc: Azure content safety Proxy usage
...
Signed-off-by: Lunik <lunik@tiwabbit.fr>
2024-05-04 10:39:43 +02:00
Krrish Dholakia
f7eee60943
docs(exception_mapping.md): update exception mapping docs with 'should_retry'
2024-05-03 16:51:26 -07:00
Ishaan Jaff
bf048ecda4
docs - titan embeddings v2
2024-05-03 15:59:02 -07:00
Ishaan Jaff
209baaca02
docs - simplify prod docs
2024-05-03 15:40:05 -07:00
Krish Dholakia
2200900ca2
Merge pull request #3393 from Priva28/main
...
Add Llama3 tokenizer and allow custom tokenizers.
2024-05-02 16:32:41 -07:00
greenscale-nandesh
6009632705
Merge branch 'main' into main
2024-05-02 08:22:11 -07:00
Christian Privitelli
3449a5e446
update docs, allow use with encode/decode
2024-05-02 14:56:38 +10:00