ishaan-jaff
|
efd5dbccd5
|
(feat) completion:openai pass OpenAI client
|
2023-11-28 16:05:01 -08:00 |
|
ishaan-jaff
|
a2d7623d6e
|
(feat) completion: Azure allow users to pass client to router
|
2023-11-28 15:56:52 -08:00 |
|
ishaan-jaff
|
ad6a3fb8fe
|
(feat) allow users to pass azure client for acmompletion
|
2023-11-28 15:44:56 -08:00 |
|
Krrish Dholakia
|
6a7a8be4fb
|
fix(utils.py): fix streaming on-success logging
|
2023-11-28 09:11:47 -08:00 |
|
ishaan-jaff
|
49afff8c88
|
(fix) completion: AZURE_OPENAI_API_KEY
|
2023-11-28 08:06:06 -08:00 |
|
Krrish Dholakia
|
91e8ff439d
|
fix(main.py): fix linting errors
|
2023-11-27 19:11:38 -08:00 |
|
Krrish Dholakia
|
01a71cd0dc
|
fix(stream_chunk_builder): adding support for tool calling in completion counting
|
2023-11-27 18:39:47 -08:00 |
|
Krrish Dholakia
|
e564f643f5
|
fix(router.py): speed improvements to the router
|
2023-11-27 17:35:26 -08:00 |
|
Krrish Dholakia
|
fb680ce4a2
|
fix(acompletion): fix acompletion raise exception issue when custom llm provider is none
|
2023-11-27 11:34:48 -08:00 |
|
ishaan-jaff
|
815e1f385b
|
(fix) embedding: filter out metadata from optional_params
|
2023-11-25 11:10:06 -08:00 |
|
Krrish Dholakia
|
ff6975a98d
|
fix(main.py): logit bias mapping for batch_completions
|
2023-11-24 16:05:51 -08:00 |
|
Krrish Dholakia
|
85854e00c9
|
fix(main.py): fixing linting issues
|
2023-11-24 15:25:51 -08:00 |
|
ishaan-jaff
|
95b7ba4634
|
(fix) completion: when logit bias is None
|
2023-11-24 14:01:21 -08:00 |
|
Krrish Dholakia
|
6f40fd8ee2
|
fix(proxy_server.py): fix linting issues
|
2023-11-24 11:39:01 -08:00 |
|
Krrish Dholakia
|
5a9a3aa89c
|
fix(main.py): fix streaming_chunk_builder to return usage
|
2023-11-24 11:27:04 -08:00 |
|
Krrish Dholakia
|
27f5703ca2
|
fix(main.py): only set api key to dynamic api key if it's not none
|
2023-11-23 16:45:44 -08:00 |
|
Krrish Dholakia
|
6999516f5b
|
fix: fix linting issues
|
2023-11-23 13:47:43 -08:00 |
|
Krrish Dholakia
|
e4f40f4535
|
fix(utils.py): support reading api keys dynamically from the os environment
|
2023-11-23 13:41:56 -08:00 |
|
ishaan-jaff
|
f3beb984ce
|
(feat) embedding() remove junk params
|
2023-11-22 14:25:55 -08:00 |
|
Krrish Dholakia
|
e495a8a9c2
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
Krrish Dholakia
|
9bb2c7ee0f
|
fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n
n
|
2023-11-22 10:04:27 -08:00 |
|
ishaan-jaff
|
20ecef8664
|
(fix) request_timeout mapped to timeout for openai
|
2023-11-21 13:22:21 -08:00 |
|
Krrish Dholakia
|
8e1dcc540f
|
fix(main.py): revert model alias map change
|
2023-11-20 21:07:52 -08:00 |
|
Krrish Dholakia
|
d83c2b9ee8
|
fix(main.py): fix model alias map logic
|
2023-11-20 20:49:10 -08:00 |
|
Krrish Dholakia
|
c7e2cbd995
|
fix(utils.py): adding support for rules + mythomax/alpaca prompt template
|
2023-11-20 18:58:15 -08:00 |
|
ishaan-jaff
|
7bc28f3b1c
|
(fix) pydantic errors with response.time
|
2023-11-20 18:28:19 -08:00 |
|
ishaan-jaff
|
756f356897
|
(fix) completion: max_retries using OpenAI client
|
2023-11-20 16:57:37 -08:00 |
|
ishaan-jaff
|
903016ba2f
|
(fix) stream chunk builder when content == None in chunk
|
2023-11-18 17:16:05 -08:00 |
|
Krrish Dholakia
|
34decba3cf
|
fix(text_completion): allow either model or engine to be set
|
2023-11-17 18:25:21 -08:00 |
|
Krrish Dholakia
|
7d70bf84a7
|
test(test_langfuse.py): handle timeouts
|
2023-11-17 17:05:46 -08:00 |
|
Krrish Dholakia
|
aa6e6865cd
|
fix(main.py): fix linting issue
|
2023-11-17 15:45:00 -08:00 |
|
Krrish Dholakia
|
02ed97d0b2
|
fix(acompletion): support client side timeouts + raise exceptions correctly for async calls
|
2023-11-17 15:39:47 -08:00 |
|
ishaan-jaff
|
ef8d82a54c
|
(feat) completion: add response_format, seed, tools, tool_choice
|
2023-11-17 13:59:57 -08:00 |
|
Ishaan Jaff
|
c4f51594d5
|
(ci/cd) run again
|
2023-11-16 21:29:18 -08:00 |
|
Krrish Dholakia
|
a6e9f147d3
|
fix(openai.py): handling extra headers
|
2023-11-16 12:48:21 -08:00 |
|
ishaan-jaff
|
7ba8271250
|
(linting) fix
|
2023-11-16 12:33:03 -08:00 |
|
ishaan-jaff
|
6645c510ac
|
(fix) linting
|
2023-11-16 12:25:46 -08:00 |
|
Krrish Dholakia
|
9c2e74b6ea
|
fix(azure.py): fix linting errors
|
2023-11-16 12:15:50 -08:00 |
|
ishaan-jaff
|
3b75dfe490
|
(fix) linting
|
2023-11-16 11:44:26 -08:00 |
|
ishaan-jaff
|
985f9b1ec1
|
(feat) text_completion add rules on when to use engine & model together
|
2023-11-16 11:37:31 -08:00 |
|
Rodney Rodriguez
|
f2d8bfd40d
|
bugfixes for ollama
|
2023-11-15 19:27:06 -06:00 |
|
Krrish Dholakia
|
6d1b8945dd
|
fix(huggingface_restapi.py): fix linting errors
|
2023-11-15 15:34:21 -08:00 |
|
Krrish Dholakia
|
e35ce15a89
|
refactor(huggingface_restapi.py): moving async completion + streaming to real async calls
|
2023-11-15 15:14:21 -08:00 |
|
Krrish Dholakia
|
0f6713993d
|
fix(router.py): enabling retrying with expo backoff (without tenacity) for router
|
2023-11-14 20:57:51 -08:00 |
|
Krrish Dholakia
|
5d58bb9cd0
|
fix(main.py): misrouting ollama models to nlp cloud
|
2023-11-14 18:55:08 -08:00 |
|
ishaan-jaff
|
d25d4d26bd
|
(feat) debug ollama POST request
|
2023-11-14 17:53:48 -08:00 |
|
Krrish Dholakia
|
9b582b2c85
|
fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors
|
2023-11-14 16:26:05 -08:00 |
|
ishaan-jaff
|
c5e81bfb19
|
(fix) linting fix
|
2023-11-14 14:40:59 -08:00 |
|
ishaan-jaff
|
7a3fa18a64
|
(feat) pass azure_ad_token to embedding
|
2023-11-14 12:48:31 -08:00 |
|
ishaan-jaff
|
c903fcc6f3
|
(feat) pass azure_ad_token to completion
|
2023-11-14 12:48:31 -08:00 |
|