OCR API for STEM
Image & PDF digitizing software for STEM companies. Make creating and searching STEM content easy for your end users.
Battle-tested with deep functionality, including math equations, chemical diagrams, tables, and full PDF document conversion.
Compatible with industry standards like PDFs, LaTeX, Asciimath, MathML, Markdown, HTML, and SMILES (for chemistry).
IMPORTANT NOTE: This product requires a good working knowledge of programming. As the OCR API license is non-refundable, we recommend using Mathpix Pro subscription for users, who don't have programming experience.
The most trusted, innovative, and limit-pushing OCR provider for STEM
We are constantly pushing the boundaries of what is possible for image recognition while maintaining extremely low latencies.
Due to our exceptional accuracy and constant innovation, we are the most trusted brand for OCR among leaders in online math education.
We process over 10 million images every day. Our system is battle tested and reliable with 99.9%+ uptime.
Global availability enables low latencies
We have 3 AWS server regions to ensure the fastest possible processing speeds for our customers in North/South America (us-east-1), Europe (eu-central-1), and Asia (ap-southeast-1). Requests are routed to the nearest region by default.
All Mathpix OCR API endpoints
Process an image
This is our most widely used endpoint. You can use this endpoint to OCR individual images of handwritten and printed equations, handwritten and printed text, tables, and diagrams to get digital formats like LaTeX, Asciimath, and MathML.
You can also request additional information about the data in the image like which alphabets were detected (we support most foreign languages), line data, word data, and confidence.
This is also the endpoint you want to process images of diagrams like geometry and chemistry. At this time we only support triangles for geometry, which are represented via vertices, edges, and labels.
Process strokes (digital ink)
Use this endpoint to add stroke recognition (ie. digital ink) to your app. This endpoint works for all math, and for Hindi and Latin alphabet language text. You can process the data for each individual stroke, or you can use a session_id for live drawing capabilities in your app (this is also the more cost-effective option).
Process a PDF
Use this endpoint to OCR PDFs and convert them to Markdown, LaTeX, and DOCX. This endpoint works asynchronously, since large PDF files can take several minutes to process. This endpoint accepts PDF files and URLs in the request.
Since requests are asynchronous, you can check the processing status of a PDF using the PDF ID returned by the API. Once the PDF has been processed, you can convert it to your desired format. The PDF endpoint can also return line-by-line data, which can be useful for building novel experiences on top of original PDFs. A separate PDF results endpoint ( v3/pdf-results) can be used to query information about a certain PDF, or all your PDFs.
Process a batch of images
Use this endpoint to process a multiple images in a single POST request. The request will return a batch ID which be queried with a GET request once an appropriate amount of time has passed for the images to process. You should only use the batch endpoint if your workflow is not latency sensitive (for example, you are digitizing a textbook and need to process all images of equations). Please also note that batch requests are priced per image, not per request.
EdTech industry leaders trust Mathpix
Digitize entire PDFs
PDFs are first converted into Mathpix Markdown, from which they can be exported to DOCX / MS Word, LaTeX, and PDF. Text, diagrams, equations, and tables, are extracted from the PDF.
Extracted Mathpix Markdown
Supported image types
Use Mathpix OCR to very accurately convert images of simple and complicated printed and handwritten math, text, tables, and chemical diagrams.
We can also recognize all the world's most spoken languages like English, Vietnamese, Spanish, French, German, Hindi, Chinese, Japanese, Russian, Korean, Thai, and more.
Multiple choice questions
Printed Foreign Alphabet text
You can find more example requests and types of images that you can process with Mathpix OCR in the User Guide.
Use cases for Mathpix OCR
Solving and Search Apps
With a long tail of advanced math, handwriting, and foreign language features, Mathpix OCR is the #1 choice for solving, tutoring, and search apps like Mathway, Doubtnut, and Toppr. These apps help students of all levels around the world learn math, find tutors, and get the help that they need.
Grading and Assessment Platforms
Whether they provide immediate answers or grade questions in bulk, grading and assessment platforms like Gradescope use Mathpix OCR in their platforms to make it faster and easier for professors to distribute and grade homework and exams with AI.
Publishing, Accessibility and LMS
Publishing companies use Mathpix OCR to create digital STEM educational material like online math textbooks. Accessible education companies like Benetech use Mathpix to make math and science educational materials accessible to all students.
Ready to get started?
Choose the plan that’s right for you and your team.
Pay As You Go
Pay as you go API pricing by route
We recommend enterprise contracts for large customers.
Process image (v3/text)
Process strokes (without live updates)(v3/strokes)
Process equation image (v3/latex)
Process batch (v3/batch)*
*Note: an image with more than 12 rows of text shall count as one PDF page and will be charged at the PDF page rate.
Process PDF (v3/pdf)
Digital ink with live updates included (v3/strokes using stroke session ID)