Text Extraction from Multiple Formats:
Our tool extracts text from PDFs, images, Word, Excel, PowerPoint, and even audio files (MP3/WAV) using advanced Tesseract OCR and Google Speech Recognition. This makes it easy to process scanned documents, images, or speech in multiple languages, including Arabic and English.
Summarization:
Using Meta’s LLaMA3.2 model, the system generates clear and concise summaries in English or Arabic, perfect for quickly grasping the key points of long documents.
Question Generation:
Not only can we summarize, but we also generate relevant questions and answers from the text, making it ideal for educators, students, and corporate trainers.
? How It Works:
Upload your document (PDF, image, audio, etc.).
The tool extracts text via OCR or speech-to-text.
Then, it summarizes the text and generates comprehension questions with answers.
? Tech Stack:
Tesseract OCR for text extraction.
Meta’s LLaMA3.2 model for summarization and question generation.
Google Speech API for audio transcription.