InLevel Up CodingbyJúlio AlmeidaClaude 3.5 — The King of Document IntelligenceAchieving Near-Perfect Document Intelligence with Claude 3.5 Sonnet and Haiku. Classification, Splitting, and ExtractionOct 29, 202415Oct 29, 202415
InTowards AIbyTapan BabbarEnhance OCR with Llama 3.2-Vision using OllamaThis project upgrades book cover recognition by using Llama 3.2-Vision to seamlessly extract and complete titles and author names from…Oct 27, 20247Oct 27, 20247
Tapan BabbarHow to Run Llama 3.2-Vision Locally With Ollama: A Game Changer for Edge AIA quick guide to running llama 3.2-vision locally using Ollama with a hands-on demo.Oct 22, 20249Oct 22, 20249
Brain TitanGOT-OCR2.0: The Future of Optical Character RecognitionDiscover GOT-OCR2.0, a groundbreaking AI model that transforms complex OCR tasks. Learn how it outperforms traditional systems and enhances…Sep 20, 20244Sep 20, 20244
InOpenSourceScribesbyC. L. Beard11 Trending Github ProjectsGrowing GitHub Projects You should watchJul 14, 2024Jul 14, 2024
Dr James RavenscroftReviewing the Best Paid and Open Source AI-Powered Handwriting OCRRemarkably (for me), this essay started its life as a scribble in a notebook rather than something I typed into a markdown editor!Apr 2, 20241Apr 2, 20241
Enrico RandelliniExploring the Microsoft Phi3 Vision Language model as OCR for document data extraction-part 2…Applications of computer vision techniques as SIFT, blur removal, brightness and contrast adjustment, to clean personal document before…Jun 26, 2024Jun 26, 2024
Simeon EmanuilovmPLUG-DocOwl 1.5: A leap forward in OCR-Free document understandingIn a recent breakthrough, the AI research community has taken a significant stride towards more efficient and comprehensive document…Mar 24, 20241Mar 24, 20241