> If AI can diminish some of the monotony of research, perhaps we can spend more time thinking, writing, playing piano, and taking walks — with other people.
Whenever any progress is made, this is the logical conclusion. And yet, those who decide about how your time is being used, have an opposing view.
coolnesstoday at 9:15 AM
Great post and amazing progress in this field! However, I have to wonder if some of these letters were part of the training data for Gemini, since they are well-known and someone has probably already done the painstaking work of transcribing them...
sphtoday at 12:29 PM
Any self-hosted open source solution? I would like to digitize my paper notebooks but I do not want to use anything proprietary or that uses external services. What is the state of the art on the FOSS side?
Ideally something that I can train with my own handwriting. I had a look at Tesseract, wondering if there’s anything better out there.
macleginntoday at 10:43 AM
I became convinced of this after the release of KuroNet: https://arxiv.org/pdf/1910.09433 (High-quality OCR of Japanese manuscripts, which look almost impossible to read.)
pjmlptoday at 9:44 AM
Maybe for English, for the other human languages I use, it is still kind of hit and miss, just like speaking recognition, even with English it suffices to have an accent that is off the standard TV one.
zkmontoday at 1:44 PM
It's painful to see that beautiful hand-writing of the past is now pretty much extinct. For me, handwriting of a person speaks a lot about them, not just their mind, but physical state as well.
girvotoday at 12:31 PM
> "transmitted": In the second line of the body, the word "transmitted" is crossed out in the original text
Am I nuts or is this wrong, not “perfect”?
It doesn’t look crossed out at all to me in the image, just some bleeding?
Still very impressive, of course
__alexstoday at 9:44 AM
Call me when it can do Russian Cursive.
DarkNova6today at 10:27 AM
> Here’s Transkribus’s best guess at George’s letter to Maryann, above:
Transkribus got a new model architecture around the corner and the results look impressive. Not only for trivial cases like text, but also for table structures and layouting.
Best of all, you can train it on your own corpus of text to support obscure languages and handwriting systems.
Really looking forward to it.
ferguess_ktoday at 1:58 PM
Don't worry, handwriting itself has diminished throughout the decades since the introduction of computers an especially smart phones.
Ah, maybe I'll pick up Qin seal when I retire, if I retire.
tigerlilytoday at 10:21 AM
Surely the true prize is to be able to ditch computers altogether and just write with pencil on paper.
iamflimflam1today at 9:36 AM
If I went back in time to the 90s when I was doing my PhD I would absolutely blow my mind with how well handwriting OCR works now.
th0ma5today at 9:37 AM
My question for OCR automation is always which digits within the numbers being read are allowed to be incorrect?
lifestylegurutoday at 11:59 AM
It feels unbelievable that in Europe literacy rate could be 10% of lower. Then I look at documents even as young as 150 years... fraktur, blackletter, elaborate handwritting. I guess I'm illiterate now.
Hopefully next generations will feel the same about legal contracts, law in general, and Java code bases. They're incomprehensible not because of fonts but because of unfathomable complexity.
nikanjtoday at 10:54 AM
The writing is on the wall for handwriting. Zoomers use speech recognition or touchscreen keyboards, millennials use keyboards. Boomers use pens