litellm-mirror/litellm
Ishaan Jaff 4777921a31
Merge pull request #2723 from BerriAI/litellm_proxy_perf_imp
[FEAT] Improve Proxy Perf - access router model names in constant time
2024-03-27 20:48:31 -07:00
..
deprecated_litellm_server refactor: add black formatting 2023-12-25 14:11:20 +05:30
integrations feat(llm_guard.py): enable key-specific llm guard check 2024-03-26 17:21:51 -07:00
llms Merge pull request #2701 from rmann-nflx/main 2024-03-27 10:14:20 -07:00
proxy Merge pull request #2723 from BerriAI/litellm_proxy_perf_imp 2024-03-27 20:48:31 -07:00
router_strategy (fix) stop using f strings with logger 2024-03-25 10:47:18 -07:00
tests test(test_completion.py): skip unresponsive endpoint 2024-03-27 20:12:22 -07:00
types (types) routerConfig 2024-01-02 14:14:29 +05:30
__init__.py fix(llm_guard.py): working llm-guard 'key-specific' mode 2024-03-26 17:47:20 -07:00
_logging.py fix(parallel_request_limiter.py): handle metadata being none 2024-03-14 10:02:41 -07:00
_redis.py fix(redis.py): fix instantiating redis client from url 2024-02-15 17:48:00 -08:00
_version.py (fix) ci/cd don't let importing litellm._version block starting proxy 2024-02-01 16:23:16 -08:00
budget_manager.py feat(utils.py): support region based pricing for bedrock + use bedrock's token counts if given 2024-01-26 14:53:58 -08:00
caching.py (fix) undo changes from other branches 2024-03-26 09:22:19 -07:00
cost.json store llm costs in budget manager 2023-09-09 19:11:35 -07:00
exceptions.py fix(main.py): map list input to ollama prompt input format 2024-02-16 11:54:12 -08:00
main.py refactor(main.py): trigger new build 2024-03-26 21:18:51 -07:00
model_prices_and_context_window_backup.json feat(router.py): enable pre-call checks 2024-03-23 18:03:30 -07:00
requirements.txt Add symlink and only copy in source dir to stay under 50MB compressed limit for Lambdas. 2023-11-22 23:07:33 -05:00
router.py fix(router.py): check for context window error when handling 400 status code errors 2024-03-26 08:08:15 -07:00
timeout.py refactor: add black formatting 2023-12-25 14:11:20 +05:30
utils.py fix(proxy/utils.py): check cache before alerting user 2024-03-27 20:09:15 -07:00