تفاصيل العمل

This project utilizes Python to extract valuable data from Arabic PDF that contains receipts of a reservations done online and transform it into a structured dataset. Using pdfplumber, the project efficiently extracts text and tabular data, ensuring accurate retrieval of Arabic content. The extracted data is then processed and structured with pandas, allowing for easy manipulation and analysis. Finally, the cleaned dataset is exported to an Excel file, making it accessible for further processing, reporting, or machine learning applications.

Technologies Used:

Python

pdfplumber (for extracting text and tables from PDFs)

pandas (for data processing and structuring)

Excel (as the final output format)

This project is ideal for handling Arabic document digitization, data extraction, and automation of data entry tasks.

the image have been censored because of customer data like phone number and address etc.

بطاقة العمل

اسم المستقل
عدد الإعجابات
0
تاريخ الإضافة
المهارات