Getting Started¶
Running the HumanEval generation example¶
Quick Start¶
Prerequisites¶
Python 3.10, 3.11 or 3.12
pip
Installation¶
pip install celi-framework
Run a Simple Demo (no LLM-setup required):¶
python -m celi_framework.main \
--job-description=celi_framework.examples.human_eval.job_description.job_description \
--tool-config='{"single_example":"HumanEval/3"}' \
--simulate-live
This example simulates using CELI to solve problem #3 of the HumanEval benchmark programming problem set. It uses
cached versions of the LLM outputs so it doesn’t require an API key or make any paid LLM calls on your behalf. The
result will be put in the target/drafts directory.
Running this demo should take a few minutes. You will be able to see how CELI tackles the problem and the LLM calls it makes, along with the responses.
Explore the Results¶
Check the output in the target/drafts directory.
Run the Example Live¶
The code above uses a cached version of the LLM results. To meaningfully run CELI on anything new, you will need to make new LLM calls, which will require an OpenAI API key (or your own local LLM; see LLM Support).
python -m celi_framework.main \
--job-description=celi_framework.examples.human_eval.job_description.job_description \
--openai-api-key=<Insert your OpenAI API key here> \
Again, you can check your output in the target/drafts directory.