Notable Quotes
"I quite like the new DeepSeek OCR paper. It's a good OCR model."
— Andrej Karpathy
"Doesn't AI safety involve validating output? How can a center for AI safety not validate the output of an AI model before rushing to publish a white paper? It sets a very bad example and frankly it's disgusting."
— Dominic Romano
"Tokenizers are ugly, separate, not to end stage. It imports all the ugliness of Unicode by encodings and inherits a lot of the historical baggage. Security/jailbreak risks."
— Andrej Karpathy
"Long-term more than 99% of input and output for AI models will be photons. Nothing else scales."
— Elon Musk
"Necessity is the mother of invention."
— Wes Roth (quoting a saying)
"How many AI safety PhDs does it take to write a paper defining AGI? None."
— The Liberator
"Andre Karpathy is an American former supermodel known for his modeling services."
— Nanochat (when asked about Andrej Karpathy)
"Good one STEG to compress millions of characters into images and then train a model to understand the text in those STE encoded images inherently."
— Elder Plius the Liberator
"I thought I told you to clean your room."
— Spongebob Squarepants meme example
"I already ranted about how much I dislike the tokenizer."
— Andrej Karpathy
"The tokenizer must go."
— Andrej Karpathy
"I have to also fight the urge to side quest an image input-only version of Nanohat."
— Andrej Karpathy
"Nano chat is a recent project by Andre Carpathy and as he says it's among the most unhinged I've written."
— Wes Roth (describing Nanochat)
"It's weird to think about, but our consciousness, our brain only experiences photons. all the lights, all the objects, everything we observed that's photons. Even when we touch things, we're not actually touching the atoms."
— Wes Roth (reflecting on Elon Musk's comment)
"We present Deepseek OCR as an initial investigation into the physibility of compressing long context via optical 2D mapping."
— Deepseek OCR paper
"Experiments show that the number of text tokens is within 10 times that of vision tokens."
— Deepseek OCR paper
"The model can achieve decoding precision of 97%."
— Deepseek OCR paper
"Even at compression ratio of 20x, the OCR accuracy remains at about 60%."
— Deepseek OCR paper
"In production, Deepsec OCR can generate training data for LMS and VLMs at a scale of 200,000 pages per day."
— Deepseek OCR paper
"LMS have a problem processing long documents there's a quadratic scaling with sequence length."
— Deepseek OCR paper
"This image can represent rich information using substantially fewer tokens than the equivalent digital text."
— Deepseek OCR paper
"These models they created, they equip the model with capabilities for parsing charts, chemical formulas, simple geometric figures, and natural images."
— Deepseek OCR paper
"Deepc OCR can generate 33 million pages of data per day for LMS and VLMs using 20 nodes."
— Deepseek OCR paper
"Their discoveries open new possibilities for how vision and language modalities can be synergistically combined to enhance computational efficiency."
— Deepseek OCR paper
"In the field of financial research reports the deep parsing mode of a deepseek OCR can be used to obtain structured results of charts within documents."
— Deepseek OCR paper
"Charts are a crucial form of data representation. finance and scientific fields."
— Deepseek OCR paper
"This technology may play a significant role in the development of models like this in the STEM fields."
— Deepseek OCR paper
"We retain Deepseek OCR's capabilities in general visual understanding mainly including image description, object detection, grounding etc."
— Deepseek OCR paper
"Because they included texton data, Deepseek OCR's language capabilities are also retained."
— Deepseek OCR paper
"When you ask a large language models for how many Rs there are in strawberry something like that it's important to understand that it's not seeing words it's seeing tokens."
— Andrej Karpathy