From a52fb7a9d94402cd7ad0fa8132631f223cea86dc Mon Sep 17 00:00:00 2001 From: Visa Doc Translator Date: Fri, 20 Feb 2026 16:25:23 +0800 Subject: [PATCH] feat(skills): add visa-doc-translate skill Add automated visa document translation skill that: - Supports multiple OCR methods (macOS Vision, EasyOCR, Tesseract) - Translates documents to professional English - Generates bilingual PDFs (original + translation) - Handles various document types (bank statements, employment certificates, etc.) - Perfect for visa applications to Australia, USA, Canada, UK, EU Tested with retirement certificates and other official documents. --- skills/visa-doc-translate/README.md | 86 ++++++++++++++++++++ skills/visa-doc-translate/SKILL.md | 117 ++++++++++++++++++++++++++++ 2 files changed, 203 insertions(+) create mode 100644 skills/visa-doc-translate/README.md create mode 100644 skills/visa-doc-translate/SKILL.md diff --git a/skills/visa-doc-translate/README.md b/skills/visa-doc-translate/README.md new file mode 100644 index 00000000..4faa35e8 --- /dev/null +++ b/skills/visa-doc-translate/README.md @@ -0,0 +1,86 @@ +# Visa Document Translator + +Automatically translate visa application documents from images to professional English PDFs. + +## Features + +- πŸ”„ **Automatic OCR**: Tries multiple OCR methods (macOS Vision, EasyOCR, Tesseract) +- πŸ“„ **Bilingual PDF**: Original image + professional English translation +- 🌍 **Multi-language**: Supports Chinese, and other languages +- πŸ“‹ **Professional Format**: Suitable for official visa applications +- πŸš€ **Fully Automated**: No manual intervention required + +## Supported Documents + +- Bank deposit certificates (存款证明) +- Employment certificates (在职证明) +- Retirement certificates (ι€€δΌ‘θ―ζ˜Ž) +- Income certificates (ζ”Άε…₯证明) +- Property certificates (房产证明) +- Business licenses (θ₯δΈšζ‰§η…§) +- ID cards and passports + +## Usage + +```bash +/visa-doc-translate +``` + +### Examples + +```bash +/visa-doc-translate RetirementCertificate.PNG +/visa-doc-translate BankStatement.HEIC +/visa-doc-translate EmploymentLetter.jpg +``` + +## Output + +Creates `_Translated.pdf` with: +- **Page 1**: Original document image (centered, A4 size) +- **Page 2**: Professional English translation + +## Requirements + +### Python Libraries +```bash +pip install pillow reportlab +``` + +### OCR (one of the following) + +**macOS (recommended)**: +```bash +pip install pyobjc-framework-Vision pyobjc-framework-Quartz +``` + +**Cross-platform**: +```bash +pip install easyocr +``` + +**Tesseract**: +```bash +brew install tesseract tesseract-lang +pip install pytesseract +``` + +## How It Works + +1. Converts HEIC to PNG if needed +2. Checks and applies EXIF rotation +3. Extracts text using available OCR method +4. Translates to professional English +5. Generates bilingual PDF + +## Perfect For + +- πŸ‡¦πŸ‡Ί Australia visa applications +- πŸ‡ΊπŸ‡Έ USA visa applications +- πŸ‡¨πŸ‡¦ Canada visa applications +- πŸ‡¬πŸ‡§ UK visa applications +- πŸ‡ͺπŸ‡Ί EU visa applications + +## License + +MIT diff --git a/skills/visa-doc-translate/SKILL.md b/skills/visa-doc-translate/SKILL.md new file mode 100644 index 00000000..394a8359 --- /dev/null +++ b/skills/visa-doc-translate/SKILL.md @@ -0,0 +1,117 @@ +--- +name: visa-doc-translate +description: Translate visa application documents (images) to English and create a bilingual PDF with original and translation +--- + +You are helping translate visa application documents for visa applications. + +## Instructions + +When the user provides an image file path, AUTOMATICALLY execute the following steps WITHOUT asking for confirmation: + +1. **Image Conversion**: If the file is HEIC, convert it to PNG using `sips -s format png --out ` + +2. **Image Rotation**: + - Check EXIF orientation data + - Automatically rotate the image based on EXIF data + - If EXIF orientation is 6, rotate 90 degrees counterclockwise + - Apply additional rotation as needed (test 180 degrees if document appears upside down) + +3. **OCR Text Extraction**: + - Try multiple OCR methods automatically: + - macOS Vision framework (preferred for macOS) + - EasyOCR (cross-platform, no tesseract required) + - Tesseract OCR (if available) + - Extract all text information from the document + - Identify document type (deposit certificate, employment certificate, retirement certificate, etc.) + +4. **Translation**: + - Translate all text content to English professionally + - Maintain the original document structure and format + - Use professional terminology appropriate for visa applications + - Keep proper names in original language with English in parentheses + - For Chinese names, use pinyin format (e.g., WU Zhengye) + - Preserve all numbers, dates, and amounts accurately + +5. **PDF Generation**: + - Create a Python script using PIL and reportlab libraries + - Page 1: Display the rotated original image, centered and scaled to fit A4 page + - Page 2: Display the English translation with proper formatting: + - Title centered and bold + - Content left-aligned with appropriate spacing + - Professional layout suitable for official documents + - Add a note at the bottom: "This is a certified English translation of the original document" + - Execute the script to generate the PDF + +6. **Output**: Create a PDF file named `_Translated.pdf` in the same directory + +## Supported Documents + +- Bank deposit certificates (存款证明) +- Income certificates (ζ”Άε…₯证明) +- Employment certificates (在职证明) +- Retirement certificates (ι€€δΌ‘θ―ζ˜Ž) +- Property certificates (房产证明) +- Business licenses (θ₯δΈšζ‰§η…§) +- ID cards and passports +- Other official documents + +## Technical Implementation + +### OCR Methods (tried in order) + +1. **macOS Vision Framework** (macOS only): + ```python + import Vision + from Foundation import NSURL + ``` + +2. **EasyOCR** (cross-platform): + ```bash + pip install easyocr + ``` + +3. **Tesseract OCR** (if available): + ```bash + brew install tesseract tesseract-lang + pip install pytesseract + ``` + +### Required Python Libraries + +```bash +pip install pillow reportlab +``` + +For macOS Vision framework: +```bash +pip install pyobjc-framework-Vision pyobjc-framework-Quartz +``` + +## Important Guidelines + +- DO NOT ask for user confirmation at each step +- Automatically determine the best rotation angle +- Try multiple OCR methods if one fails +- Ensure all numbers, dates, and amounts are accurately translated +- Use clean, professional formatting +- Complete the entire process and report the final PDF location + +## Example Usage + +```bash +/visa-doc-translate RetirementCertificate.PNG +/visa-doc-translate BankStatement.HEIC +/visa-doc-translate EmploymentLetter.jpg +``` + +## Output Example + +The skill will: +1. Extract text using available OCR method +2. Translate to professional English +3. Generate `_Translated.pdf` with: + - Page 1: Original document image + - Page 2: Professional English translation + +Perfect for visa applications to Australia, USA, Canada, UK, and other countries requiring translated documents.