InLevel Up CodingbyJúlio AlmeidaClaude 3.5 — The King of Document IntelligenceAchieving Near-Perfect Document Intelligence with Claude 3.5 Sonnet and Haiku. Classification, Splitting, and ExtractionOct 2915Oct 2915
InTowards AIbyTapan BabbarEnhance OCR with Llama 3.2-Vision using OllamaThis project upgrades book cover recognition by using Llama 3.2-Vision to seamlessly extract and complete titles and author names from…Oct 275Oct 275
Tapan BabbarHow to Run Llama 3.2-Vision Locally With Ollama: A Game Changer for Edge AIA quick guide to running llama 3.2-vision locally using Ollama with a hands-on demo.Oct 224Oct 224
Brain TitanGOT-OCR2.0: The Future of Optical Character RecognitionDiscover GOT-OCR2.0, a groundbreaking AI model that transforms complex OCR tasks. Learn how it outperforms traditional systems and enhances…Sep 204Sep 204
InOpenSourceScribesbyC. L. Beard11 Trending Github ProjectsGrowing GitHub Projects You should watchJul 14Jul 14
Dr James RavenscroftReviewing the Best Paid and Open Source AI-Powered Handwriting OCRRemarkably (for me), this essay started its life as a scribble in a notebook rather than something I typed into a markdown editor!Apr 21Apr 21
Enrico RandelliniExploring the Microsoft Phi3 Vision Language model as OCR for document data extraction-part 2…Applications of computer vision techniques as SIFT, blur removal, brightness and contrast adjustment, to clean personal document before…Jun 26Jun 26
Simeon EmanuilovmPLUG-DocOwl 1.5: A leap forward in OCR-Free document understandingIn a recent breakthrough, the AI research community has taken a significant stride towards more efficient and comprehensive document…Mar 241Mar 241