Merge pull request #48 from topoteretes/update_blog

Update docs
topoteretes · Mar 16, 2024 · dd4132c · dd4132c
2 parents f024e08 + 6db1066
commit dd4132c
Show file tree

Hide file tree

Showing 32 changed files with 943 additions and 45 deletions.
diff --git a/.github/workflows/py_lint.yaml b/.github/workflows/py_lint.yaml
@@ -1,32 +1,32 @@
-name: Python Linting
-
-on:
-  push:
-    branches: [ main ]  # This will trigger the workflow on pushes to the main branch
-  pull_request:  # This will trigger the workflow on any pull request to any branch
-
-jobs:
-  lint:
-    runs-on: ubuntu-latest
-    steps:
-    - uses: actions/checkout@v3
-    - name: Set up Python
-      uses: actions/setup-python@v4
-      with:
-        python-version: '3.11'  # Specify the Python version you want to use
-
-    - name: Install Poetry
-      run: |
-        curl -sSL https://install.python-poetry.org | python3 -  # Install Poetry
-
-    - name: Configure Poetry
-      run: |
-        poetry config virtualenvs.create false  # Configure poetry to not create a new virtual environment
-
-    - name: Install dependencies
-      run: |
-        poetry install  # Install the dependencies specified in pyproject.toml
-
-    - name: Run pylint
-      run: |
-        pylint $(git ls-files '*.py')  # Run pylint on all Python files in the repository
+#name: Python Linting
+#
+#on:
+#  push:
+#    branches: [ main ]  # This will trigger the workflow on pushes to the main branch
+#  pull_request:  # This will trigger the workflow on any pull request to any branch
+#
+#jobs:
+#  lint:
+#    runs-on: ubuntu-latest
+#    steps:
+#    - uses: actions/checkout@v3
+#    - name: Set up Python
+#      uses: actions/setup-python@v4
+#      with:
+#        python-version: '3.11'  # Specify the Python version you want to use
+#
+#    - name: Install Poetry
+#      run: |
+#        curl -sSL https://install.python-poetry.org | python3 -  # Install Poetry
+#
+#    - name: Configure Poetry
+#      run: |
+#        poetry config virtualenvs.create false  # Configure poetry to not create a new virtual environment
+#
+#    - name: Install dependencies
+#      run: |
+#        poetry install  # Install the dependencies specified in pyproject.toml
+#
+#    - name: Run pylint
+#      run: |
+#        pylint $(git ls-files '*.py')  # Run pylint on all Python files in the repository
diff --git a/README.md b/README.md
@@ -112,6 +112,17 @@ poetry add cognee
 Check out our demo notebook [here](cognee%20-%20Get%20Started.ipynb)
 
 
+- Set OpenAI API Key as an environment variable
+```
+import os
+
+# Setting an environment variable
+os.environ['OPENAI_API_KEY'] = ''
+
+
+```
+
+
 - Add a new piece of information to storage
 ```
 import cognee

diff --git a/cognee - Get Started.ipynb b/cognee - Get Started.ipynb
@@ -539,6 +539,8 @@
     }
    ],
    "source": [
+    "from cognee import search\n",
+    "from cognee.api.v1.search.search import SearchType\n",
     "query_params = {\n",
     "    SearchType.SIMILARITY: {'query': 'your search query here'}\n",
     "}\n",

diff --git a/docs/blog/index.md b/docs/blog/index.md
@@ -1,4 +1,4 @@
-# Welcome to the cognee Blog
+# Welcome to the cognee blog
 
 The goal of the blog is to discuss broader topics around the cognee project, including the motivation behind the project, the technical details, and the future of the project.
 

diff --git a/...ain + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/Untitled 1.png b/...ain + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/Untitled 1.png
diff --git a/...chain + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/Untitled.png b/...chain + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/Untitled.png
diff --git a/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(19).png b/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(19).png
diff --git a/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(20).png b/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(20).png
diff --git a/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(21).png b/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(21).png
diff --git a/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(22).png b/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(22).png
diff --git a/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(23).png b/...in + Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/carbon_(23).png
diff --git a/... Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/infographic_(2).png b/... Weaviate Level 2 towards 98ad7b915139478992c4c4386b5e5886/infographic_(2).png
diff --git a/...eaviate Level 3 towards e62946c272bf412584b12fbbf92d35b0/Dashboard_example.png b/...eaviate Level 3 towards e62946c272bf412584b12fbbf92d35b0/Dashboard_example.png
diff --git a/...Weaviate Level 4 towards fe90ff40e56e44c4a49f1492d360173c/How_cognee_works.png b/...Weaviate Level 4 towards fe90ff40e56e44c4a49f1492d360173c/How_cognee_works.png
diff --git a/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled 1.png b/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled 1.png
diff --git a/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled 2.png b/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled 2.png
diff --git a/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled 3.png b/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled 3.png
diff --git a/...chain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled.png b/...chain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/Untitled.png
diff --git a/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/carbon_(5).png b/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/carbon_(5).png
diff --git a/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/carbon_(6).png b/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/carbon_(6).png
diff --git a/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/carbon_(9).png b/...ain + Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/carbon_(9).png
diff --git a/...eaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/infographic_(2) 1.png b/...eaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/infographic_(2) 1.png
diff --git a/...eaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/infographic_(2) 2.png b/...eaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/infographic_(2) 2.png
diff --git a/... Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/infographic_(2).png b/... Weaviate and towards a pr 7351d77a1eba40aab4394c24bef3a278/infographic_(2).png
diff --git a/docs/blog/posts/from-demo-to-production-1.md b/docs/blog/posts/from-demo-to-production-1.md
diff --git a/docs/blog/posts/from-demo-to-production-2.md b/docs/blog/posts/from-demo-to-production-2.md
@@ -1,5 +1,173 @@
 
+---
+draft: False
+date: 2023-10-05
+tags:
+  - pydantic
+  - langchain
+  - llm
+  - openai 
+  - functions
+  - pdfs
+authors:
+  - tricalt
+---
 
+# Going beyond Langchain + Weaviate: Level 2 towards Production
+
+### 1.1. The problem of putting code to production
+
+*This post is a part of a series of texts aiming to discover and understand patterns and practices that would enable building a production-ready AI data infrastructure. The main focus is on how to evolve data modeling and retrieval in order to enable Large Language Model (LLM) apps and Agents to serve millions of users concurrently.*
+
+*For a broad overview of the problem and our understanding of the current state of the LLM landscape, check out [our previous post](https://www.prometh.ai/promethai-memory-blog-post-one)*
+
+![infographic (2).png](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/infographic_(2).png)
+
+In this text, we continue our inquiry into what would constitute:
+
+1. Proper data engineering methods for LLMs
+2. A production-ready generative AI data platform that unlocks AI assistants/Agent Networks
+
+To explore these points, we here at [prometh.ai](http://prometh.ai/) have partnered with dlthub in order to productionize a common use case — complex PDF processing — progressing level by level.
+
+In the previous text, we wrote a simple script that relies on the Weaviate Vector database to turn unstructured data into structured data and help us make sense of it.
+
+In this post, some of the shortcomings from the previous level will be addressed, including::
+
+1. Containerization
+2. Data model
+3. Data contract
+4. Vector Database retrieval strategies
+5. LLM context and task generation
+6. Dynamic Agent behavior and Agent tooling
+
+## 3. Level 2:  Memory Layer + FastAPI + Langchain + Weaviate
+
+### 3.1. Developer Intent at Level 2
+
+This phase enhances the basic script by incorporating:
+
+- Memory Manager
+
+    The memory manager facilitates the execution and processing of VectorDB data by:
+
+    1. Uniformly applying CRUD (Create, Read, Update, Delete) operations across various classes
+    2. Representing different business domains or concepts, and
+    3. Ensuring they adhere to a common data model, which regulates all data points across the system.
+- Context Manager
+
+    This central component processes and analyzes data from Vector DB, evaluates its significance, and compares the results with user-defined benchmarks.
+
+    The primary objective is to establish a mechanism that encourages in-context learning and empowers the Agent’s adaptive understanding.
+
+    As an example, let’s assume we uploaded the book *A Call of the Wild* by Jack London to our Vector DB semantic layer, to give our LLM a better understanding of the life of sled dogs in the early 1900s.
+
+    Asking a question about the contents of the book will yield a straightforward answer, provided that the book contains an explicit answer to our question.
+
+    To enable better question answering and access to additional information such as historical context, summaries, and other documents, we need to introduce different memory stores and a set of **attention modulators**, which are meant to manage the prioritization of data retrieved for the answers.
+
+- Task Manager
+
+    Utilizing the tools at hand and guided by the user's prompt, the task manager determines a sequence of actions and their execution order.
+
+    For example, let’s assume that the user asks: “When was Buck (one of the dogs from *A Call of the Wild*) kidnapped” and to have the answer translated to German”
+
+    This query would be broken down by the task manager into a set of atomic tasks that can then be handed over to the Agent.
+
+    The ordered task list could be:
+
+    1. Retrieve information about the PDF from the database.
+    2. Translate the information to German.
+- The Agent
+
+    AI agents can use computers independently. They can browse the web, use apps, read and write files, make credit card payments, and even autonomously execute processes on your personal computer.
+
+    In our case, the Agent has only a few tools at its disposal, such as tools to translate text or structure data. Using these tools, it processes and executes tasks in the sequence they are provided by the Task Manager and the Context Manager.
+
+
+### 3.2 **Toward the memory layer** - POC at level 2
+
+
+
+![Untitled](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/Untitled.png)
+
+At this stage, our proof of concept (POC) allows uploading a PDF document and requesting specific actions on it such as "load to database", "translate to German", or "convert to JSON." Prior task resolutions and potential operations are assessed by the Context Manager and Task Manager services.
+
+The following set of steps explains the workflow of the POC at level 2:
+
+- Initially, we specify the parameters for the document we wish to upload and define our objective in the prompt:
+
+![Untitled](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/Untitled%201.png)
+
+- The memory manager retrieves the parameters and the attention modulators and creates context based on Episodic and Semantic memory stores (previous runs of the job + raw data):
+
+    ![carbon (23).png](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/carbon_(23).png)
+
+
+- To do this, it starts by filtering user input, in the same way our brains filter important from redundant information. As an example, if there are children playing and talking loudly in the background during our Zoom meeting, we can still pool our attention together and focus on what the person on the other side is saying.
+
+    The same principle is applied here:
+
+
+![carbon (19).png](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/carbon_(19).png)
+
+- In the next step, we apply a set of attention modulators to process the data obtained from the Vector Store.
+
+    *NOTE: In cognitive science, attention modulators can be thought of as factors or     mechanisms that influence the direction and intensity of attention.*
+
+    *As we have many memory stores, we need to prioritize the data points that we retrieve via semantic search.*
+
+    *Since semantic search is not enough by itself, scoring data points happens via a set of functions that replicate how attention modulators work in cognitive science.*
+
+    Initially, we’ve implemented a few attention modulators that we thought could improve the document retrieval process:
+
+    **Frequency**: This refers to how often a specific stimulus or event is encountered. Stimuli that are encountered more frequently are more likely to be attended to or remembered.
+
+    **Recency**: This refers to how recently a stimulus or event was encountered. Items or events that occurred more recently are typically easier to recall than those that occurred a long time ago.
+
+
+We have implemented many more, and you can find them in our
+
+[repository](https://github.com/topoteretes/PromethAI-Memory). More are still needed and contributions are more than welcome.
+
+Let’s see the modulators in action:
+
+![carbon (20).png](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/carbon_(20).png)
+
+In the code above we fetch the memories from the Semantic Memory bank where our knowledge of the world is stored (the PDFs). We select the relevant documents by using the handle_modulator function.
+
+- The handle_modulator function is defined below and explains how scoring of memories happens.
+
+![carbon (21).png](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/carbon_(21).png)
+
+We process the data retrieved with OpenAI functions and store the results for the Task Manager to be able to determine what actions the Agent should take.
+
+The Task Manager then sorts and converts user input into a set of actionable steps based on the tools available.
+
+![carbon (22).png](Going%20beyond%20Langchain%20+%20Weaviate%20Level%202%20towards%20%2098ad7b915139478992c4c4386b5e5886/carbon_(22).png)
+
+Finally, the Agent interprets the context and performs the steps using the tools it has available. We see this as the step where the Agents take over the task, executing it in their own way.
+
+Now, let's look back at what constitutes the Data Platform:
+
+| Memory type  | State | Description |
+| --- | --- | --- |
+| Sensory Memory | API | Can be interpreted in this context as the interface used for the human input  |
+| STM | Weaviate Class with hardcoded contract | The processing layer and a storage of the session/user context |
+| LTM | Weaviate Class with hardcoded contract | The information storage |
+
+Lacks:
+
+- Extendability: Capability to add features or functionality.
+- Loading flexibility: Ability to apply different chunking strategies
+- Testability: How to test the code and make sure it runs
+
+**Next Steps**:
+
+1. Implement different strategies for vector search
+2. Add more tools to process PDFs
+3. Add more attention modulators
+4. Add a solid test framework