logo
Abbyy Joins The Dots With Optical Character Recognition For Developers

Abbyy Joins The Dots With Optical Character Recognition For Developers

Forbes17-04-2025
Data is for databases. It's clear that information in all its forms generally resides in some form of data store, data repository or data coalition, collection and coalescence tool so that we can go and get it when we need it. This core truth means that data is for databases and databases are for database administrators.
But data is also for developers.
Software application development engineers take a primary role in wrangling with data management in many scenarios, not least of which is the act of extracting reliable and consistent data from business documents. Because the world of enterprise business has moved (and is still moving) to a digital-first base of operations (you may have heard about so-called digital transformation, just maybe), organizations need to ingest and encode valuable documents (some digital, but still a lot of paper) so that the content they contain can form a functional part of business workflows.
But business workflows can get lumpy and suffer from disruptions. What we want are intelligent automation workflows that happen inside business operations with digital documentation from the start. This is the pain point that intelligent automation company Abbyy seeks to address directly with the launch of its ABBYY Document AI service. Stylizing its brand in capitals as it does, Abbyy's new intelligent document processing tool is accessible through what here is defined as a 'self-service' (meaning developers don't need the operations team to enable it) application programming interface.
The company says that its Document AI API was built with the developer's experience in mind. It allows software engineers to transform unstructured business documents into structured, accurate data with just a few lines of code. This functionality makes it easier to integrate and work with optical character recognition and intelligent document processing solutions.
'As a vanguard of OCR, Abbyy has long had a vibrant community of cutting-edge developers creating transformational solutions with our advanced document AI,' said Nick Hyatt, vice president, engineering R&D at Abbyy. 'We are providing them with a new API with minimal setup as well as access to ample community resources, pre-trained models for building proof-of-concepts and a predictable pay-as-you-go pricing model. Abbyy Document AI API is a major step forward for developing automated document workflows.'
According to analyst house IDC, the intelligent document processing market is projected to grow from $2.4 billion in 2023 to $10.5 billion in 2028, This 34.9% CAGR is thought to be driven by a number of factors, with key drivers including increasing cloud adoption and cloud-native development, the maturation of AI services as we move out of the intelligence hypecycle into practical use cases and expanded document AI use cases in general.
'In the age of AI, optical character recognition is experiencing a true renaissance,' said Amy Machado, senior research manager for enterprise content and knowledge management strategies at IDC. 'Developers struggle with extracting reliable data from documents and will often begin with general large language models for this process. However, they quickly face challenges with hallucinations, data inconsistencies and errors in document processing. [They also] often lack support for multiple [human] languages, handwriting recognition and complex document structures. There is a need for purpose-built solutions specifically designed for document processing that prioritizes easy integration, flexibility, scalability, accuracy and consistency.'
Abbyy says that the Abbyy Document AI API enables software developers to enhance workflows with 'pre-trained models to extract data' from documents, which in turn naturally empowers teams to be able to accelerate automation for complex business processes like KYC (a set of guidelines and principles used mainly by financial institutions to verify and validate new clients, standing for know your customer as it does), business account openings of all forms, customs clearance, invoice processing, expense management and order processing.
Abbyy Document AI API enables quick, accurate and effortless data extraction to quickly convert business documents of any type, format or language. According to Abbyy, this new software offering provides 'precision OCR', capable of flawlessly preserving a document's logical structure to provide AI-ready data that is essential to unlocking insights in generative AI and retrieval augmented generation. It can also help with core tasks associated with forming the robust foundation needed to train language models.
This news comes on the heels of the company establishing new AI labs across the United States, Hungary and India to accelerate the development of purpose-built AI for intelligent document processing and process automation.
'Our proprietary datasets, AI platform and model research and development combined with our deep domain knowledge create foundational intellectual property that will significantly enhance our core solutions and enable expansion into adjacent enterprise applications. We're building upon decades of leadership in OCR, machine learning, computer vision, and natural language processing while extending innovations in AI with our industry expertise. This integrated approach powers next-generation multimodal models to deliver more robust, consistent outcomes that transform business processes,' said Sanjay Nichani, vice president for, AI & computer vision at Abbyy.
We hear a lot from enterprise technology vendors who tell us about their focus on customer experiences and now, more recently, the need to make great customer experiences happen by first enabling good software developer experiences. Underling this truth, Abbyy VP Hyatt has said that, 'The developer experience is a crucial aspect of our product strategy. Our teams look forward to making next-generation ABBYY AI easier to consume with modern APIs and developer tools.'
With so much at stake in the data management arena, we clearly need to think about data for databases and database administrators, data for developers as showcased here… and the resultant new data services that consumers will be able to use when AI and automation enters its Industry 3.0 phase which it must next logically do.
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Mediagenix Named an IDC Innovator in Media and Entertainment, 2025
Mediagenix Named an IDC Innovator in Media and Entertainment, 2025

Miami Herald

time3 days ago

  • Miami Herald

Mediagenix Named an IDC Innovator in Media and Entertainment, 2025

BRUSSELS, BE / ACCESS Newswire / July 22, 2025 / Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, has been named an IDC Innovator in the IDC Innovators: Media and Entertainment 2025* report. According to the report, "Mediagenix positions itself as the global provider in smart content solutions to profitably connect the right content to the right audience. Its modular SaaS platform orchestrates the entire content life cycle to actively drive content lifetime value and audience engagement. Content strategy, content value management, content scheduling, and content personalization all converge into one lean, company-wide collaborative flow revolving around one source of truth. This 'unified' approach to the content life cycle makes it unique in the industry." According to IDC, "IDC Innovators for 2025 have one common main objective in mind: to provide their customers high-value workflow efficiencies and return on investment." The IDC Innovators report noted three key differentiators for Mediagenix: AI Integration, Efficiency Gains, and Differentiated Content Life-Cycle Workflow Solution. "Mediagenix exemplifies innovation in media operations with its unified, AI-powered approach to content strategy, scheduling automation, and intelligent discovery," said Alex Holtz, Research Director, Worldwide Media and Entertainment Digital Strategies at IDC. "By integrating tools like Human Semantic Search and Smart Content Pool into a modular SaaS platform, Mediagenix delivers a differentiated, end-to-end content life-cycle solution that boosts efficiency, accelerates time-to-air, and improves audience engagement. In a market where media companies struggle with automation and system connectivity, and where rapid cloud adoption adds operational complexity, Mediagenix enables seamless orchestration across multiple platform distributions, helping media companies confidently navigate transformation with minimal disruption." Personalization is a Strategic Driver Unlike traditional personalization approaches that focus on the front-end user layer, Mediagenix applies content and audience intelligence across the entire content chain. By integrating Mediagenix Recommendation into core planning and scheduling tools, customers can act on real-time engagement data to improve performance across: Discovery and Curation - Humanized Semantic Search delivers personalized, transparent content results, with Mediagenix reporting up to 23% increased engagement and 35% better Automation - Tools such as Scheduling Artist and Smart Content Pool dramatically reduce time-to-air and increase rights usage efficiency, accelerating channel launches by up to 80%.Strategic Planning - Web-native planning interfaces improve efficiency by over 30%, allowing teams to align editorial, rights, and platform strategy with predicted audience demand. "We are honored to be recognized as an IDC Innovator in media and entertainment," said Fabrice Maquignon, CEO of Mediagenix. "Personalization must be more than a front-end experience. It needs to inform the upstream decisions that determine which content is produced, acquired, and promoted. That's where real engagement and content profitability are driven." Mediagenix solutions were recently honored with multiple awards at the 2025 NAB Show. For more information about these innovative solutions, please visit *doc #US52275525, May 2025 About IDC Innovators: An IDC Innovators report presents a set of vendors - under $100M in annual revenue at the time of selection - chosen by an IDC analyst within a specific market that offer a new technology, a groundbreaking solution to an existing issue, and/or an innovative business model. It is not an exhaustive evaluation or a comparative ranking of all companies, but rather a document that highlights innovative companies in a specific market segment. IDC INNOVATOR and IDC INNOVATORS are trademarks of International Data Group, Inc. About Mediagenix Mediagenix is a global leader in smart content solutions to profitably connect the right content to the right audience. The Mediagenix modular SaaS platform orchestrates the entire content lifecycle to actively drive content lifetime value and audience engagement. Content strategy, content value management, content scheduling and content personalization all converge into one lean, company-wide collaborative flow revolving around one source of truth. Headquartered in Brussels, Mediagenix has offices in Bangkok, Denver, London, Madrid, Miami, New York City, Paris, Singapore, Skopje, and Sydney. With a team of 400+ experts working closely with 10,000+ users, Mediagenix is the trusted partner for more than 200 media companies globally. Press Contact Melissa HardingGrithaus Agency (e) melissa@ SOURCE: Mediagenix

David D'Alessandro Joins IDC as Chairman of the Board
David D'Alessandro Joins IDC as Chairman of the Board

Business Wire

time3 days ago

  • Business Wire

David D'Alessandro Joins IDC as Chairman of the Board

BOSTON--(BUSINESS WIRE)--International Data Corporation (IDC), the trusted technology intelligence leader, today announced the appointment of David D'Alessandro as Chairman of its Board of Directors. A seasoned executive with a career spanning financial services, sports, and entertainment, D'Alessandro brings a legacy of leadership grounded in brand transformation, operational rigor, and ethical governance. IDC, the trusted technology intelligence leader, today announced the appointment of David D'Alessandro as Chairman of its Board of Directors. Share He succeeds Steve Singh, who will continue to serve as a director on the Board. Singh, the former Chairman and CEO of Concur and a former Member of the Executive Board of SAP, recently stepped into the role of interim CEO at Spotnana. 'David is a strategist, a brand visionary, and a respected leader who understands the power of data to drive meaningful outcomes,' said Genevieve Juillard, CEO of IDC. 'His experience leading through complex market transitions and advising organizations through transformation will be invaluable as IDC continues its path of innovation and growth." D'Alessandro spent two decades at John Hancock, where he rose from marketing executive to Chairman and CEO. He led the company through a high-profile IPO and its subsequent merger with Manulife Financial. Under his leadership, John Hancock became a globally recognized sponsor of events like the Boston Marathon, the New York City Marathon, and the Olympic Games. He later served as Chairman and CEO of SeaWorld Parks & Entertainment and held a seat on Major League Baseball's Special Task Force on the 21st Century. D'Alessandro currently serves as Chairman of Encore Event Technologies. D'Alessandro is also the author of three best-selling books on leadership and ethics in business: Brand Warfare, Career Warfare, and Executive Warfare. He owns Toscano restaurants in Boston and Cambridge and remains a strong voice on corporate responsibility and performance. 'IDC is entering an exciting new chapter,' said D'Alessandro. 'The strategic divestment of Foundry — completed in March 2025 — frees up capital and focus for IDC's core strength: delivering sharp, data-driven technology intelligence. This transition positions us to double down on innovation and AI-powered insights that empower businesses to navigate complex market shifts with confidence.' D'Alessandro joins a board committed to supporting IDC's mission to deliver trusted tech intelligence that illuminates the path forward for technology buyers and suppliers worldwide. About IDC International Data Corporation (IDC) is the premier global provider of market intelligence, advisory services, and events for the information technology, telecommunications, and consumer technology markets. With more than 1,100 analysts worldwide, IDC offers global, regional, and local expertise on technology, IT benchmarking and sourcing, and industry opportunities and trends in over 110 countries. IDC's analysis and insight helps IT professionals, business executives, and the investment community to make fact-based technology decisions and to achieve their key business objectives. Founded in 1964, IDC is the world's leading tech media, data, and marketing services company. To learn more about IDC, please visit Follow IDC on Twitter at @IDC and LinkedIn. Subscribe to the IDC Blog for industry news and insights.

ABBYY Hackathon Winners Demo the Power of Document AI, AI Agents and GenAI to Solve Enterprise Challenges
ABBYY Hackathon Winners Demo the Power of Document AI, AI Agents and GenAI to Solve Enterprise Challenges

Yahoo

time3 days ago

  • Yahoo

ABBYY Hackathon Winners Demo the Power of Document AI, AI Agents and GenAI to Solve Enterprise Challenges

AUSTIN, Texas, July 22, 2025--(BUSINESS WIRE)--ABBYY hosted its third annual AI Pulse Developer Conference and Hackathon July 9-10 where 35 entrants registered to build their most innovative solutions using Document AI with AI agents and generative AI technologies to solve real-world enterprise problems. Four teams wowed the more than 300 attendees winning categories for Best Overall App, Best Use of an ABBYY Product, Best Integration of Third-Party AI, and an honorary mention of a Student Excellence Award. The judges for the hackathon were ABBYY executives Paula Sanders, SVP of Pre and Post Sales, Neil Murphy, Chief Revenue Officer and Bruce Orcutt, Chief Marketing Officer. "When we kicked off DevCon three years ago, we wanted to build a global community where developers could challenge the status quo and reimagine what's possible with AI and documents and cure the biggest headaches related to document processing," commented Murphy. "This year's solutions were nothing short of inspiring - turning complex challenges like healthcare policy interpretation and expense compliance into smart, intuitive experiences. What excites me most is how far we've come, with ABBYY investing in new R&D hubs, Centers of Excellence, and the tools to empower this growing ecosystem of innovators. We're just getting started." The winner for Best Overall App was Team Deloitte, consisting of Kaustubha Uday Vaidya, Rithi M, S Shanthaseelan and Md. Shahid Akhtar. Their solution, Spend Guard, automated the scanning, interpretation and validation of business expense receipts. They leveraged ABBYY Vantage for data extraction and agentic AI using Gemini to ensure policy compliance by instantly flagging non-compliant expenses and reasons for flagging. Their app accelerated expense claim processing, reduced manual effort, and helped organizations minimize errors and policy violations. Commented Rithi of Team Deloitte, "Participating in the ABBYY Developer Conference was an incredible opportunity to push the boundaries of what's possible with AI-driven document processing. The energy, innovation, and support from the ABBYY community made this an unforgettable experience." Winning Best Use of an ABBYY Product was Team McKinsey led by Sathish Kumar Murugan who developed a cutting-edge solution designed to break language barriers in global document processing through the power of AI. The Smart Translator used ABBYY Vantage to intelligently extract data from scanned or image-based documents and integrated ChatGPT for instant, context-aware translation. It eliminated the need for manual data entry and translation, saving time and reducing errors and empowering teams to work faster, smarter, and more collaboratively—regardless of language or document format. Sathish stated, "Participating in the ABBYY hackathon was an inspiring journey of innovation and collaboration. Building Smart Translator showed us how AI and intelligent OCR can come together to solve real-world challenges with speed and accuracy." The Best Integration of Third-Party AI combined ABBYY Vantage and OCR and Document Skills with a variety of AI tools including spaCY, OpenAI Embeddings, ChromaDB, LangChain Agents and Gradio. Created by Team Telstra with Krishna Kumar S and Madhu Shankar, their Insurance AIdvisor was an agentic AI assistant that simplified health insurance policies so users would no longer be surprised by denied claims or financial shock due to not understanding complex jargon or fine print. Madhu Shankar of Team Telstra commented, "We were proud to be part of AI Pulse, a platform that highlighted the transformative potential of agentic AI. ABBYY's strategic evolution within this forward-looking ecosystem—by empowering Intelligent Document Processing (IDP) tools—positions it as a key enabler in the development of autonomous, decision-capable AI systems." The AI Pulse Developer Conference was also a forum for students to take what they learned in the classroom and apply it to a real-world scenario. Earning an honorary mention for a Student Excellence Award was Team CMRIT with Sarvottam Bhagat and Deepankar Sharma from CMR Institute of Technology. Said Sarvottam Bhagat from Team CMRIT, "Participating in the ABBYY Developer Conference Hackathon was a phenomenal learning experience. It pushed us to creatively integrate multiple AI agents with ABBYY's powerful document processing tools live and in real time. We competed with companies like Tech Mahindra, Wipro, Deloitte, and others. We learned a lot and connected with some truly amazing people along the way." Their SILO AI app was a unified document automation toolkit that simplified and accelerated document-heavy workflows. It analyzed various document types such as invoices, KYC and contracts, then suggested the most suitable ABBYY model for processing, and routed the processed data through an n8n workflow, delivering results via Gmail and Slack through an MCP server. To learn more about combining IDP and large language models (LLMs), download the playbook, "Next-Generation Document Automation: Combining Document AI and Generative AI," at To hear more insights from the intelligent automation industry's biggest influencers, subscribe to the AI Pulse podcast at and AI Pulse e-newsletter at About ABBYY ABBYY uses purpose-built AI to transform data and workflows from business-critical processes to accelerate decisions and drive better outcomes. More than 10,000 customers, including many Fortune 500 companies, rely on ABBYY's industry-leading Process AI and Document AI to accelerate customer experiences, operational excellence, and achieve a competitive advantage. ABBYY is a global company with headquarters in Austin, Texas and offices in 13 countries. For more information, visit and follow us on LinkedIn, Twitter, Facebook, and Instagram. ABBYY can either be a registered trademark or a trademark and can also be a logo, a company name (or part of it), or part of a product name of ABBYY group companies and may not be used without consent of its respective owners. View source version on Contacts Editorial Contact: Gina +1 949-370-0941 Sign in to access your portfolio

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store