mirror of https://github.com/BerriAI/litellm.git synced 2025-04-26 03:04:13 +00:00

History

Krish Dholakia c3edfc2c92 All checks were successful Read Version from pyproject.toml / read-version (push) Successful in 35s Details LiteLLM Minor Fixes & Improvements (12/23/2024) - p3 (#7394 ) * build(model_prices_and_context_window.json): add gemini-1.5-flash context caching * fix(context_caching/transformation.py): just use last identified cache point Fixes https://github.com/BerriAI/litellm/issues/6738 * fix(context_caching/transformation.py): pick first contiguous block - handles system message error from google Fixes https://github.com/BerriAI/litellm/issues/6738 * fix(vertex_ai/gemini/): track context caching tokens * refactor(gemini/): place transformation.py inside `chat/` folder make it easy for user to know we support the equivalent endpoint * fix: fix import * refactor(vertex_ai/): move vertex_ai cost calc inside vertex_ai/ folder make it easier to see cost calculation logic * fix: fix linting errors * fix: fix circular import * feat(gemini/cost_calculator.py): support gemini context caching cost calculation generifies anthropic's cost calculation function and uses it across anthropic + gemini * build(model_prices_and_context_window.json): add cost tracking for gemini-1.5-flash-002 w/ context caching Closes https://github.com/BerriAI/litellm/issues/6891 * docs(gemini.md): add gemini context caching architecture diagram make it easier for user to understand how context caching works * docs(gemini.md): link to relevant gemini context caching code * docs(gemini/context_caching): add readme in github, make it easy for dev to know context caching is supported + where to go for code * fix(llm_cost_calc/utils.py): handle gemini 128k token diff cost calc scenario * fix(deepseek/cost_calculator.py): support deepseek context caching cost calculation * test: fix test		2024-12-23 22:02:52 -08:00
..
docs	LiteLLM Minor Fixes & Improvements (12/23/2024) - p3 (#7394 )	2024-12-23 22:02:52 -08:00
img	LiteLLM Minor Fixes & Improvements (12/23/2024) - p3 (#7394 )	2024-12-23 22:02:52 -08:00
release_notes	update release notes	2024-12-23 21:48:33 -08:00
src	bye (#6982 )	2024-12-05 13:38:10 -08:00
static	v1	2023-08-17 15:31:20 -07:00
.gitignore	Add docs to export logs to Laminar (#6674 )	2024-11-11 12:15:47 -08:00
babel.config.js	updating docs	2023-08-12 11:30:32 -07:00
Dockerfile	(docs) new dockerfile for litellm proxy	2023-11-17 17:39:07 -08:00
docusaurus.config.js	Litellm docs update (#7365 )	2024-12-21 21:09:50 -08:00
index.md	fix keys	2023-08-17 16:13:52 -07:00
package-lock.json	fix(main.py): fix retries being multiplied when using openai sdk (#7221 )	2024-12-14 11:56:55 -08:00
package.json	bye (#6982 )	2024-12-05 13:38:10 -08:00
README.md	updating docs	2023-08-12 11:30:32 -07:00
sidebars.js	docs add files to supported endpoints	2024-12-23 20:51:34 -08:00

README.md

Website

This website is built using Docusaurus 2, a modern static website generator.

Installation

$ yarn

Local Development

$ yarn start

This command starts a local development server and opens up a browser window. Most changes are reflected live without having to restart the server.

Build

$ yarn build

This command generates static content into the build directory and can be served using any static contents hosting service.

Deployment

Using SSH:

$ USE_SSH=true yarn deploy

Not using SSH:

$ GIT_USER=<Your GitHub username> yarn deploy

If you are using GitHub pages for hosting, this command is a convenient way to build the website and push to the gh-pages branch.