It doesn’t allow me to install any Python packages.
I am trying to write a program to read a docx file containing a table with a table header and Unicode characters.
I can convert it into a PDF, HTML, or any file format outside of the Python environment.
However, I need to be able to read and act on the tabular content using the built-in libraries available in Python. I tried to convert the file into HTML and then use read_html with pandas, but then again, the lxml package was not installed, and hence, I got stuck.
CSV in principle works with Unicode, but it depends on the application / settings you use to convert. But that’s not a Python question and offtopic here. The ontopic answer would be to use Python libraries to convert, like Pandas with lxml, but then you’re back to the original problem.
Great! Thanks, Peter. I have now confirmed that the CSV file has all the unicode characters. Can you please suggest how I can write Python code to read the content and process it from the CSV file?