Removing text between tags Thread poster: Brian O Callaghan
|
Hi there, I have a file with tags between every word in long paragraphs, I think it's an OCR'ed document which might explain all the tags. I don't mind having all the tags at the end of each translation unit, so I'm looking for a way to isolate the tags only or the text only, and to remove the text. I hope this makes sense, I'd really appreciate if anyone had any ideas on this. I've come across this issue many times of course, but never as bad as this! Thanks, Brian | | | Natalie Poland Local time: 09:13 Member (2002) English to Russian + ... Moderator of this forum SITE LOCALIZER | Samuel Murray Netherlands Local time: 09:13 Member (2006) English to Afrikaans + ...
Yes. This is a blog post by Emma Goldsmith. She essentially says "go back to Word and fix it there". Her solution works if (a) you have access to the original Word file and (b) you are allowed to create a new SDLXLIFF file. Do you have access to the original Word file and are you allowed to create a new SDLXLIFF file, Brian? You may also find a number of other threads here -- just enter, for example, "tag soup" in the search field and don't forget to tick the 'exact match' option. I find that searches for "tag soup" only gets me solutions about how to fix the problem in Word. I could not find any solution that is similar to Brian's suggestion, i.e. an easy way to shift all tags to the end of the segment. This may also have to include a method to strip the active segment's target text from all tags. Do you know of such a method, Natalie? I have encountered segments in which I wanted to tag the target text in a segment from scratch, but could not find an easy way to strip all tags in one go, so I then I had to select the tags manually, one by one, to delete them. Does anyone know of a way to delete all tags from the active segments target text, and a way to insert all tags at once, at the cursor position? | | | Natalie Poland Local time: 09:13 Member (2002) English to Russian + ... Moderator of this forum SITE LOCALIZER In general... | May 19, 2019 |
Samuel Murray wrote: I find that searches for "tag soup" only gets me solutions about how to fix the problem in Word. I could not find any solution that is similar to Brian's suggestion, i.e. an easy way to shift all tags to the end of the segment. This may also have to include a method to strip the active segment's target text from all tags. Do you know of such a method, Natalie? ...working with tons of redundant tags makes no sense, irrespectively of the source file format. However, there is a method (suggested here in the forum by somebody a few years ago) that allows you to do what you want, however, in a different way: not by moving the tags, but by removing all text and leaving just the tags. This allows you to start typing your translation where you want, before the tags, between or after them. 1) Open the sdlxliff in the editor 2) Copy all source to target, and place the cursor in the first position of the first segment 3) Go to Find&Replace In the 'Find what' field type '.' (just a fullstop, without quotation marks) In the 'Replace with' field do not type anything, just leave it empty In the 'Find options' tick the 'Use' option, and choose 'Regular expressions' Click 'Find next' and then 'Replace all' The target column will be stripped of all text and only the tags will be left. | |
|
|
Yep, find and replace | May 20, 2019 |
Samuel and Natalie, thanks for your replies. After a night thinking about it, this is exactly what I did, although I had to go through every letter, number and a good few special characters, I wasn't aware of the shortcut suggested in the previous forum post. This is a great way to handle enormous numbers of tags, for sure, just avoid doing it on a laptop as it runs quite slow in large documents! All the best, Brian | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Removing text between tags CafeTran Espresso | You've never met a CAT tool this clever!
Translate faster & easier, using a sophisticated CAT tool built by a translator / developer.
Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools.
Download and start using CafeTran Espresso -- for free
Buy now! » |
| Anycount & Translation Office 3000 | Translation Office 3000
Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |