POC application to implement Q&A on a website utilizing the Langchain framework.
- Clone the repository.
- Install python 3.12 virtual environment and activate it.
- Install Poetry:
pip install poetry
- Install dependencies
poetry install
- Create a
.env
file in the root directory using the provided.env.example
file as a template.
poetry run crawler --max-seen-urls=3 https://gradio.app/
INFO - Written 3 urls to gradio.app-urls.txt
poetry run crawler --help
poetry run process_urls gradio.app-urls.txt
...
INFO - Saved index to html_vector_index
...
INFO - Saved index to text_vector_index
...
INFO - Report saved to report.csv
poetry run process_urls --help
poetry run app
App will be available at http://localhost:7860