Merge pull request #255 from CindyPersonalGit/main

LGTM — visa-doc-translate skill. On-device OCR only, no external calls.
This commit is contained in:
Affaan Mustafa
2026-02-24 09:24:19 -08:00
committed by GitHub
2 changed files with 203 additions and 0 deletions

View File

@@ -0,0 +1,86 @@
# Visa Document Translator
Automatically translate visa application documents from images to professional English PDFs.
## Features
- 🔄 **Automatic OCR**: Tries multiple OCR methods (macOS Vision, EasyOCR, Tesseract)
- 📄 **Bilingual PDF**: Original image + professional English translation
- 🌍 **Multi-language**: Supports Chinese, and other languages
- 📋 **Professional Format**: Suitable for official visa applications
- 🚀 **Fully Automated**: No manual intervention required
## Supported Documents
- Bank deposit certificates (存款证明)
- Employment certificates (在职证明)
- Retirement certificates (退休证明)
- Income certificates (收入证明)
- Property certificates (房产证明)
- Business licenses (营业执照)
- ID cards and passports
## Usage
```bash
/visa-doc-translate <image-file>
```
### Examples
```bash
/visa-doc-translate RetirementCertificate.PNG
/visa-doc-translate BankStatement.HEIC
/visa-doc-translate EmploymentLetter.jpg
```
## Output
Creates `<filename>_Translated.pdf` with:
- **Page 1**: Original document image (centered, A4 size)
- **Page 2**: Professional English translation
## Requirements
### Python Libraries
```bash
pip install pillow reportlab
```
### OCR (one of the following)
**macOS (recommended)**:
```bash
pip install pyobjc-framework-Vision pyobjc-framework-Quartz
```
**Cross-platform**:
```bash
pip install easyocr
```
**Tesseract**:
```bash
brew install tesseract tesseract-lang
pip install pytesseract
```
## How It Works
1. Converts HEIC to PNG if needed
2. Checks and applies EXIF rotation
3. Extracts text using available OCR method
4. Translates to professional English
5. Generates bilingual PDF
## Perfect For
- 🇦🇺 Australia visa applications
- 🇺🇸 USA visa applications
- 🇨🇦 Canada visa applications
- 🇬🇧 UK visa applications
- 🇪🇺 EU visa applications
## License
MIT

View File

@@ -0,0 +1,117 @@
---
name: visa-doc-translate
description: Translate visa application documents (images) to English and create a bilingual PDF with original and translation
---
You are helping translate visa application documents for visa applications.
## Instructions
When the user provides an image file path, AUTOMATICALLY execute the following steps WITHOUT asking for confirmation:
1. **Image Conversion**: If the file is HEIC, convert it to PNG using `sips -s format png <input> --out <output>`
2. **Image Rotation**:
- Check EXIF orientation data
- Automatically rotate the image based on EXIF data
- If EXIF orientation is 6, rotate 90 degrees counterclockwise
- Apply additional rotation as needed (test 180 degrees if document appears upside down)
3. **OCR Text Extraction**:
- Try multiple OCR methods automatically:
- macOS Vision framework (preferred for macOS)
- EasyOCR (cross-platform, no tesseract required)
- Tesseract OCR (if available)
- Extract all text information from the document
- Identify document type (deposit certificate, employment certificate, retirement certificate, etc.)
4. **Translation**:
- Translate all text content to English professionally
- Maintain the original document structure and format
- Use professional terminology appropriate for visa applications
- Keep proper names in original language with English in parentheses
- For Chinese names, use pinyin format (e.g., WU Zhengye)
- Preserve all numbers, dates, and amounts accurately
5. **PDF Generation**:
- Create a Python script using PIL and reportlab libraries
- Page 1: Display the rotated original image, centered and scaled to fit A4 page
- Page 2: Display the English translation with proper formatting:
- Title centered and bold
- Content left-aligned with appropriate spacing
- Professional layout suitable for official documents
- Add a note at the bottom: "This is a certified English translation of the original document"
- Execute the script to generate the PDF
6. **Output**: Create a PDF file named `<original_filename>_Translated.pdf` in the same directory
## Supported Documents
- Bank deposit certificates (存款证明)
- Income certificates (收入证明)
- Employment certificates (在职证明)
- Retirement certificates (退休证明)
- Property certificates (房产证明)
- Business licenses (营业执照)
- ID cards and passports
- Other official documents
## Technical Implementation
### OCR Methods (tried in order)
1. **macOS Vision Framework** (macOS only):
```python
import Vision
from Foundation import NSURL
```
2. **EasyOCR** (cross-platform):
```bash
pip install easyocr
```
3. **Tesseract OCR** (if available):
```bash
brew install tesseract tesseract-lang
pip install pytesseract
```
### Required Python Libraries
```bash
pip install pillow reportlab
```
For macOS Vision framework:
```bash
pip install pyobjc-framework-Vision pyobjc-framework-Quartz
```
## Important Guidelines
- DO NOT ask for user confirmation at each step
- Automatically determine the best rotation angle
- Try multiple OCR methods if one fails
- Ensure all numbers, dates, and amounts are accurately translated
- Use clean, professional formatting
- Complete the entire process and report the final PDF location
## Example Usage
```bash
/visa-doc-translate RetirementCertificate.PNG
/visa-doc-translate BankStatement.HEIC
/visa-doc-translate EmploymentLetter.jpg
```
## Output Example
The skill will:
1. Extract text using available OCR method
2. Translate to professional English
3. Generate `<filename>_Translated.pdf` with:
- Page 1: Original document image
- Page 2: Professional English translation
Perfect for visa applications to Australia, USA, Canada, UK, and other countries requiring translated documents.