litellm-mirror/litellm
Krrish Dholakia ca97ea8acd feat(proxy_server.py): team based model aliases
allow setting model aliases at a team level (e.g. route all 'gpt-3.5-turbo' requests from team-1 to model-deployment-group-2)
2024-03-06 17:42:08 -08:00
..
deprecated_litellm_server refactor: add black formatting 2023-12-25 14:11:20 +05:30
integrations (fix) high traffic langfuse, s3 2024-03-06 12:22:52 -08:00
llms fix(vertex_ai.py): correctly parse optional params and pass vertex ai project 2024-03-06 14:00:50 -08:00
proxy feat(proxy_server.py): team based model aliases 2024-03-06 17:42:08 -08:00
router_strategy fix(lowest_latency.py): consistent time calc 2024-02-14 15:03:35 -08:00
tests fix(vertex_ai.py): correctly parse optional params and pass vertex ai project 2024-03-06 14:00:50 -08:00
types (types) routerConfig 2024-01-02 14:14:29 +05:30
__init__.py Merge branch 'main' into litellm_claude_3_bedrock_access 2024-03-05 07:10:45 -08:00
_logging.py fix(proxy_server.py): update user cache to with new spend 2024-02-06 23:06:05 -08:00
_redis.py fix(redis.py): fix instantiating redis client from url 2024-02-15 17:48:00 -08:00
_version.py (fix) ci/cd don't let importing litellm._version block starting proxy 2024-02-01 16:23:16 -08:00
budget_manager.py feat(utils.py): support region based pricing for bedrock + use bedrock's token counts if given 2024-01-26 14:53:58 -08:00
caching.py fix(utils.py): only return cached streaming object for streaming calls 2024-02-21 21:27:40 -08:00
cost.json store llm costs in budget manager 2023-09-09 19:11:35 -07:00
exceptions.py fix(main.py): map list input to ollama prompt input format 2024-02-16 11:54:12 -08:00
main.py fix(vertex_ai.py): correctly parse optional params and pass vertex ai project 2024-03-06 14:00:50 -08:00
model_prices_and_context_window_backup.json fix(bedrock.py): add claude 3 support 2024-03-04 17:15:47 -08:00
requirements.txt Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
router.py Revert "(feat) track used api_base in response" 2024-03-02 11:12:09 -08:00
timeout.py refactor: add black formatting 2023-12-25 14:11:20 +05:30
utils.py fix(utils.py): handle dict object for chatcompletionmessagetoolcall 2024-03-05 18:10:58 -08:00