what 262,715 regex questions on stack overflow haven't answered (part 2)

18 points by abareplace

0x2ba22e11

Those three diagrams with Sisyphus and Icarus made me laugh.

It's a little unfortunate that that one StackOverflow answer about using regexes to parse HTML is so widespread when the actually textbook correct answer that doesn't require berating strangers on StackOverflow is just "you can't parse HTML with regex but you can lex it with regexes and feed tokens into a parser". The academic tradition for writing programming language parsers is after all to use lex (which is regex based) for tokenisation and yacc or one of its descendants (typically LR family) for parsing.

Reading this illiterate style without proper capitalization is so tiring

addison

I'll admit I'm also not fond of it, because I tend not to see periods and thus rely on capitals. Still, author's presentation of discretion, can still be read.
abareplace

You have to prove that you are not an LLM these days, that's why people write without capitalization.
- addison
  
  I don't believe that this is the case at all. It has always come across as a stylistic choice and I saw it quite regularly before slop appeared.
- kenballus
  
  I go back and forth on using it because I know it bothers some people, but I think monospace-lowercase is pretty slick. This opinion has nothing to do with LLMs.
WilhelmVonWeiner

You can always pipe it through sed and fix it yourself (tutorial)
- novalis
  
  I sort of assumed this was a joke, i.e. That you were saying that cases like e.g. The one in the middle of this sentence were going to trip you up, and thus maybe a regex wasn't going to help.
  - WilhelmVonWeiner
    
    Better an imperfect solution than a perfect daydream of one