v0.2.0
Features
- Vision/Multimodality: Neuro will now take screenshots and send it to a multimodal llm for processing! Great for reaction content or live commentary.
Changes
- Refactored llmWrappers. TextLLMWrapper and ImageLLMWrapper are subclasses of AbstractLLMWrapper. This should allow for easier extensions onto other LLMs with differing API formats.
- Updated the RealtimeTTS and RealtimeSTT libraries, which solves some dependency headaches. If you have dependency troubles upgrading from an existing installation, trying removing all packages and reinstalling following the installation instructions.
Note: Release versions do indicate that it has been end-to-end tested, but I still recommend cloning main for bug fixes and tweaks that make the AI behave better.