
Amazon Textract particularly differentiated itself when processing menus with unique fonts, background images, and low image resolutions. Restaurants often get creative when designing their menus, so OCR robustness was crucial for this use case. To improve system performance, we fine-tuned these models by generating a dataset of 1.5 million synthetic text images that were more representative of text in menus.Īfter evaluating our in-house solution and several commercial OCR solutions, we found that Amazon Textract offers the best text recognition precision and recall.

The extracted email addresses and phone numbers. It is designed to extract email address and phone numbers with various criteria & various options to give best results.
#Zomato data extractor software#
This extractor is the FASTEST Software available on internet. Extracts all email addresses and phone numbers from source website.

The challenge with these models was that they were trained on a standard text dataset that didn’t match the eclectic fonts found in restaurant menus. Zomato Data Extractor tool can intelligently collect bulk data for digital marketing purpose. We first created an in-house OCR solution by stacking a pre-trained text detection model and a pre-trained text recognition model. For our use case, we experimented with both in-house and commercial OCR solutions. This process is known as optical character recognition (OCR). and restaurant choices of customers in a city by scraping data from Zomato. The first component of this solution was to accurately extract all the text in the menu image. This repository helps you extract data from the official zomato website by. Extracting raw text from menus with Amazon Textract This post summarizes how we used Amazon Textract and Amazon SageMaker to develop a customized menu digitization solution. To develop this menu digitization technology, we partnered with Amazon ML Solutions Lab to explore the capabilities of the AWS ML Stack. We power this functionality with machine learning (ML), using it to extract and structure text data from menu images. These capabilities enable us to recommend restaurants to zomato users based on searches for specific dishes. zomato is a global food-tech company based in India.Īre you the kind of person who has very specific cravings? Maybe when the mood hits, you don’t want just any kind of Indian food-you want Chicken Chettinad with a side of paratha, and nothing else will hit the spot! To help picky eaters satisfy their cravings, we at zomato have recently added enhanced search engine capabilities to our restaurant aggregation and food delivery platform.

#Zomato data extractor update#

It is designed to extract phone numbers with various categories & keywords.User can add multiple location and multiple keywords at same time.This extractor is the FASTEST Software available on internet.Extracts all phone numbers from source site.User can get millions of realtime & fresh data from source site.Zomato Lead Extractor is a tool that collect contact information such as Name, Location, Complete Address, Mobile & WhatsApp Number, Ratings and Reviews, Location co-ordinates, Website URL and other important information from Zomato Website.Zomato Lead Extractor software can intelligently collect bulk data from source site. It is one of the fastest software available in the market.
#Zomato data extractor manual#
Extract premium unlimited business / contacts from public listings on Zomato website with no manual effort.
