Krrish Dholakia
|
ff6975a98d
|
fix(main.py): logit bias mapping for batch_completions
|
2023-11-24 16:05:51 -08:00 |
|
Krrish Dholakia
|
85854e00c9
|
fix(main.py): fixing linting issues
|
2023-11-24 15:25:51 -08:00 |
|
ishaan-jaff
|
95b7ba4634
|
(fix) completion: when logit bias is None
|
2023-11-24 14:01:21 -08:00 |
|
Krrish Dholakia
|
6f40fd8ee2
|
fix(proxy_server.py): fix linting issues
|
2023-11-24 11:39:01 -08:00 |
|
Krrish Dholakia
|
5a9a3aa89c
|
fix(main.py): fix streaming_chunk_builder to return usage
|
2023-11-24 11:27:04 -08:00 |
|
Krrish Dholakia
|
27f5703ca2
|
fix(main.py): only set api key to dynamic api key if it's not none
|
2023-11-23 16:45:44 -08:00 |
|
Krrish Dholakia
|
6999516f5b
|
fix: fix linting issues
|
2023-11-23 13:47:43 -08:00 |
|
Krrish Dholakia
|
e4f40f4535
|
fix(utils.py): support reading api keys dynamically from the os environment
|
2023-11-23 13:41:56 -08:00 |
|
ishaan-jaff
|
f3beb984ce
|
(feat) embedding() remove junk params
|
2023-11-22 14:25:55 -08:00 |
|
Krrish Dholakia
|
e495a8a9c2
|
fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints
|
2023-11-22 13:22:58 -08:00 |
|
Krrish Dholakia
|
9bb2c7ee0f
|
fix(utils.py): add param mapping for perplexity, anyscale, deepinfra
n
n
|
2023-11-22 10:04:27 -08:00 |
|
ishaan-jaff
|
20ecef8664
|
(fix) request_timeout mapped to timeout for openai
|
2023-11-21 13:22:21 -08:00 |
|
Krrish Dholakia
|
8e1dcc540f
|
fix(main.py): revert model alias map change
|
2023-11-20 21:07:52 -08:00 |
|
Krrish Dholakia
|
d83c2b9ee8
|
fix(main.py): fix model alias map logic
|
2023-11-20 20:49:10 -08:00 |
|
Krrish Dholakia
|
c7e2cbd995
|
fix(utils.py): adding support for rules + mythomax/alpaca prompt template
|
2023-11-20 18:58:15 -08:00 |
|
ishaan-jaff
|
7bc28f3b1c
|
(fix) pydantic errors with response.time
|
2023-11-20 18:28:19 -08:00 |
|
ishaan-jaff
|
756f356897
|
(fix) completion: max_retries using OpenAI client
|
2023-11-20 16:57:37 -08:00 |
|
ishaan-jaff
|
903016ba2f
|
(fix) stream chunk builder when content == None in chunk
|
2023-11-18 17:16:05 -08:00 |
|
Krrish Dholakia
|
34decba3cf
|
fix(text_completion): allow either model or engine to be set
|
2023-11-17 18:25:21 -08:00 |
|
Krrish Dholakia
|
7d70bf84a7
|
test(test_langfuse.py): handle timeouts
|
2023-11-17 17:05:46 -08:00 |
|
Krrish Dholakia
|
aa6e6865cd
|
fix(main.py): fix linting issue
|
2023-11-17 15:45:00 -08:00 |
|
Krrish Dholakia
|
02ed97d0b2
|
fix(acompletion): support client side timeouts + raise exceptions correctly for async calls
|
2023-11-17 15:39:47 -08:00 |
|
ishaan-jaff
|
ef8d82a54c
|
(feat) completion: add response_format, seed, tools, tool_choice
|
2023-11-17 13:59:57 -08:00 |
|
Ishaan Jaff
|
c4f51594d5
|
(ci/cd) run again
|
2023-11-16 21:29:18 -08:00 |
|
Krrish Dholakia
|
a6e9f147d3
|
fix(openai.py): handling extra headers
|
2023-11-16 12:48:21 -08:00 |
|
ishaan-jaff
|
7ba8271250
|
(linting) fix
|
2023-11-16 12:33:03 -08:00 |
|
ishaan-jaff
|
6645c510ac
|
(fix) linting
|
2023-11-16 12:25:46 -08:00 |
|
Krrish Dholakia
|
9c2e74b6ea
|
fix(azure.py): fix linting errors
|
2023-11-16 12:15:50 -08:00 |
|
ishaan-jaff
|
3b75dfe490
|
(fix) linting
|
2023-11-16 11:44:26 -08:00 |
|
ishaan-jaff
|
985f9b1ec1
|
(feat) text_completion add rules on when to use engine & model together
|
2023-11-16 11:37:31 -08:00 |
|
Rodney Rodriguez
|
f2d8bfd40d
|
bugfixes for ollama
|
2023-11-15 19:27:06 -06:00 |
|
Krrish Dholakia
|
6d1b8945dd
|
fix(huggingface_restapi.py): fix linting errors
|
2023-11-15 15:34:21 -08:00 |
|
Krrish Dholakia
|
e35ce15a89
|
refactor(huggingface_restapi.py): moving async completion + streaming to real async calls
|
2023-11-15 15:14:21 -08:00 |
|
Krrish Dholakia
|
0f6713993d
|
fix(router.py): enabling retrying with expo backoff (without tenacity) for router
|
2023-11-14 20:57:51 -08:00 |
|
Krrish Dholakia
|
5d58bb9cd0
|
fix(main.py): misrouting ollama models to nlp cloud
|
2023-11-14 18:55:08 -08:00 |
|
ishaan-jaff
|
d25d4d26bd
|
(feat) debug ollama POST request
|
2023-11-14 17:53:48 -08:00 |
|
Krrish Dholakia
|
9b582b2c85
|
fix(main.py): keep client consistent across calls + exponential backoff retry on ratelimit errors
|
2023-11-14 16:26:05 -08:00 |
|
ishaan-jaff
|
c5e81bfb19
|
(fix) linting fix
|
2023-11-14 14:40:59 -08:00 |
|
ishaan-jaff
|
7a3fa18a64
|
(feat) pass azure_ad_token to embedding
|
2023-11-14 12:48:31 -08:00 |
|
ishaan-jaff
|
c903fcc6f3
|
(feat) pass azure_ad_token to completion
|
2023-11-14 12:48:31 -08:00 |
|
ishaan-jaff
|
cd4ed86a77
|
(feat) completion:azure - add azure_ad_token
|
2023-11-14 12:12:14 -08:00 |
|
Krrish Dholakia
|
8ff9d8fabe
|
fix(main.py): fix linting errors
|
2023-11-13 15:31:46 -08:00 |
|
Krrish Dholakia
|
6ca8528c25
|
fix(main.py): fix linting errors
|
2023-11-13 14:52:37 -08:00 |
|
Krrish Dholakia
|
330708e7ef
|
fix(tests): fixing response objects for testing
|
2023-11-13 14:39:30 -08:00 |
|
Krrish Dholakia
|
bdf801d987
|
fix(together_ai.py): exception mapping for tgai
|
2023-11-13 13:17:15 -08:00 |
|
ishaan-jaff
|
f388000566
|
(fix) deepinfra with openai v1.0.0
|
2023-11-13 09:51:22 -08:00 |
|
Krrish Dholakia
|
d9f54f04e3
|
build(main.py): trigger testing
|
2023-11-11 19:20:48 -08:00 |
|
Krrish Dholakia
|
4b74ddcb17
|
refactor: fixing linting issues
|
2023-11-11 18:52:28 -08:00 |
|
Krrish Dholakia
|
e7d1840d5f
|
fix(main.py): fix caching for router
|
2023-11-11 17:45:23 -08:00 |
|
Krrish Dholakia
|
c0a757a25f
|
refactor(azure.py): working azure completion calls with openai v1 sdk
|
2023-11-11 16:44:39 -08:00 |
|