I’ve been building two productivity tools using Python: a Voice-to-Text converter (using speech_recognition + Google API) and a Screen OCR tool (using mss + pytesseract). Both are currently working well for English, but I’m hitting some architectural challenges when trying to expand them to support multiple languages (e.g., Chinese, Spanish, Japanese).