User talk:Content
July 2021
[edit]Hello, and welcome to Wikipedia. This is a message letting you know that one or more of your recent edits to Philotimo have been undone by an automated computer program called ClueBot NG.
- ClueBot NG makes very few mistakes, but it does happen. If you believe the change you made was constructive, please read about it, report it here, remove this message from your talk page, and then make the edit again.
- For help, take a look at the introduction.
- The following is the log entry regarding this message: Philotimo was changed by Content (u) (t) ANN scored at 1 on 2021-07-09T21:02:26+00:00
Thank you. ClueBot NG (talk) 21:02, 9 July 2021 (UTC)
Missing {
[edit]Hey, thanks for doing that work a big help. They don't show up in tracking categories hard to find. -- GreenC 05:10, 16 July 2021 (UTC)
- @GreenC Easy to find: [1], but looks like nobody does it. :) Content (talk) 13:30, 16 July 2021 (UTC)
That includes many false positives? I actually did write a program that, by accident, gave citations missing a closing '}'. So a few times I cleaned them up but not for a while as I am too busy. And it was only cite news|journal|magazine|book not web. Here are the 50 or so current yesterday.
Extended content
|
---|
|
It's possible a few of these are OK but contain more than 10 or so embedded templates, templates within templates, which caused the program to list (it was not designed for this purpose). -- GreenC 16:22, 16 July 2021 (UTC)
- @GreenC You looking for missing closing '}' only? There is also missing opening '{', missing '[' or ']' inside parameters in template, and...... look at this: http://en.wiki.x.io/w/index.php?title=Daisy_Tapley&diff=prev&oldid=1033908235
- I think, this way to search is good for manual editing. There is not many false positives in it. Content (talk) 16:39, 16 July 2021 (UTC)
- Oh for sure, this was by accident the program has a different purpose. Probably need multiple methods to find the varieties. -- GreenC 16:56, 16 July 2021 (UTC)
- I think, to find them all we need find all 'cite news' inside wiki-text, then find closest '{{' on the left of it and closest '}}' on the right (exactly, may be sequence of '{{' and '}}') and try to parse result to find error in formatting. Looks easy, but not for 6M pages.:) Content (talk) 17:11, 16 July 2021 (UTC)
- Yes sort of what my program does, though it's only checking for trailing }} and logs if it can't find one. It's just trying to parse out all valid citations and logs errors. It processes 6M weekly, using parallel processing on toolforge grid. With more job slots it could do it every 18 hours, but it would use up my quota. -- GreenC 18:51, 16 July 2021 (UTC)
- @GreenC: Wow! What programming language you use? Content (talk) 00:05, 17 July 2021 (UTC)
- You'll probably laugh (I do). Its GNU awk. https://github.com/greencardamom/xcite .. there is Nim module there for the parsing part. -- 00:23, 17 July 2021 (UTC)
- @GreenC: Wow! What programming language you use? Content (talk) 00:05, 17 July 2021 (UTC)
- Yes sort of what my program does, though it's only checking for trailing }} and logs if it can't find one. It's just trying to parse out all valid citations and logs errors. It processes 6M weekly, using parallel processing on toolforge grid. With more job slots it could do it every 18 hours, but it would use up my quota. -- GreenC 18:51, 16 July 2021 (UTC)
- I think, to find them all we need find all 'cite news' inside wiki-text, then find closest '{{' on the left of it and closest '}}' on the right (exactly, may be sequence of '{{' and '}}') and try to parse result to find error in formatting. Looks easy, but not for 6M pages.:) Content (talk) 17:11, 16 July 2021 (UTC)
- Oh for sure, this was by accident the program has a different purpose. Probably need multiple methods to find the varieties. -- GreenC 16:56, 16 July 2021 (UTC)
@GreenC: Nothing to laugh. It's interesting. (I try to write primitive parser of wiki-templates in python right now.) (talk) 00:57, 17 July 2021 (UTC)