Visual programming framework that utilizes natural language instructions and integrates with OpenAI's API for seamless visual program creation, debugging, and evaluation.
-
Install Anaconda (If you haven't installed it yet)
Follow the instructions to install Anaconda from here. -
Create a new folder to use VISPROG
-
Open terminal and follow the steps:
-
Git clone the main branch of the VISPROG GitHub repository
git clone https://github.com/hyunhp/VISPROG.git
-
Change directory to the cloned Git folder
cd visprog -
Install dependencies
conda env create -f environment.yaml
-
Activate the conda virtual environment
conda activate visprog
-
-
Create a
.envfile inside thevisprogfolder, and insert your OPENAI_API_KEY information without quotes:OPENAI_API_KEY=your_api_key -
Debug and update the original VISPROG repo:
- Update the OpenAI model calling part ("text-davinci-003" has been deprecated as of January 4th, 2024).
- Update the code
vis_masksinvis_utils.py. - Update CUDA usage inside
step_interpreters.py. - Update NLVR prompts to debug answers in digit format (e.g., 1, 2, 3) instead of text format (e.g., one, two, three) in the eval interpreters function.
-
Handle the 410 error encountered when using the
FaceDetInterpreterclass:- Download the DSFD_RES152.pth model and save it into your local torch hub folder directory (e.g.,
C:/{USER_NAME}/.cache/torch/hub/checkpoints).
- Download the DSFD_RES152.pth model and save it into your local torch hub folder directory (e.g.,
UPDATED AT: 7th July, 2024