Hello,
For my activist publishing, I am doing a lot of post-#
OCR formatting manually. One of the most tedious tasks is to join several lines back into one paragraph, so they can be formatted (or machine-translated) properly.
Right now, I have 2022 lines of text to be processed like that, using my favourite #
Gedit.
What could help? Possibly a #
plugin, doing few simple actions:
-- grab the text manually selected,
-- replace all non-hyphenated ends of line with space
-- remove all hanging hyphens and their EOLs.
If anyone knows such a plugin, or perhaps another text editor with such functionality, or maybe would like to volunteer to write such a piece of code, please let me know.