Screen Capture OCR Chrome Extension

A Chrome extension that captures screen content, performs OCR, and generates intelligent insights using LLMs.

Features

Screen capture with adjustable quality settings
OCR text extraction using Tesseract.js
LLM-powered analysis using Hugging Face Inference API
Modern, responsive UI with progress indicators
Efficient image processing and caching
Error handling and validation

Installation

Clone the repository:

git clone https://github.com/KPrathamesh-27/Guide_Extension

Install dependencies:

# Install backend dependencies
cd backend
npm install

3. Configure environment variables:
Create a `.env` file in the backend directory:
```env
PORT=3000
HUGGING_FACE_API_KEY=your_api_key_here

Load the extension in Chrome:

Open Chrome and navigate to chrome://extensions
Enable "Developer mode"
Click "Load unpacked"
Select the extension directory

Development

Backend Development

cd backend
npm run dev

Extension Development

Make changes to the extension code
Reload the extension in Chrome

Usage

Click the extension icon in Chrome
Click "Capture Screen" to capture the current tab
Optional: Add specific instructions in the text input
Wait for processing and view results

Tech Stack

Frontend:
- HTML/CSS/JavaScript
- Chrome Extension APIs
- Modern UI components
Backend:
- Node.js/Express
- Tesseract.js for OCR
- Hugging Face Inference API
- File type validation

Contributing

Fork the repository
Create your feature branch
Commit your changes
Push to the branch
Create a Pull Request

License

MIT License

Author

Prathamesh Kusalkar

Acknowledgments

Tesseract.js team
Hugging Face team
Chrome Extensions documentation

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
components		components
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
icon.png		icon.png
manifest.json		manifest.json
popup.css		popup.css
popup.html		popup.html
popup.js		popup.js
spinner.gif		spinner.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Screen Capture OCR Chrome Extension

Features

Installation

Development

Backend Development

Extension Development

Usage

Tech Stack

Contributing

License

Author

Acknowledgments

About

Releases

Packages

Languages

License

KPrathamesh-27/Guide_Extension

Folders and files

Latest commit

History

Repository files navigation

Screen Capture OCR Chrome Extension

Features

Installation

Development

Backend Development

Extension Development

Usage

Tech Stack

Contributing

License

Author

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages