2025-09-16 10:35:09 [scrapy.utils.log] INFO: Scrapy 2.12.0 started (bot: zomato_check_in) 2025-09-16 10:35:09 [scrapy.utils.log] INFO: Versions: lxml 5.3.0.0, libxml2 2.12.9, cssselect 1.2.0, parsel 1.9.1, w3lib 2.2.1, Twisted 24.11.0, Python 3.10.12 (main, Aug 15 2025, 14:32:43) [GCC 11.4.0], pyOpenSSL 24.3.0 (OpenSSL 3.4.0 22 Oct 2024), cryptography 44.0.0, Platform Linux-6.8.0-1015-aws-aarch64-with-glibc2.35 2025-09-16 10:35:09 [integration_check] INFO: Dynamic attribute _job = a663ddaf92e811f0929fa729f0037d6c 2025-09-16 10:35:09 [scrapy.addons] INFO: Enabled addons: [] 2025-09-16 10:35:09 [asyncio] DEBUG: Using selector: EpollSelector 2025-09-16 10:35:09 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor 2025-09-16 10:35:09 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop 2025-09-16 10:35:09 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor 2025-09-16 10:35:09 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop 2025-09-16 10:35:09 [scrapy.extensions.telnet] INFO: Telnet Password: e572ee6bdf82e88e 2025-09-16 10:35:09 [scrapy.middleware] INFO: Enabled extensions: ['scrapy.extensions.corestats.CoreStats', 'scrapy.extensions.telnet.TelnetConsole', 'scrapy.extensions.memusage.MemoryUsage', 'scrapy.extensions.logstats.LogStats', 'scrapy_extensions.extension.BandwidthLoggerExtension', 'scrapeops_scrapy.extension.ScrapeOpsMonitor'] 2025-09-16 10:35:09 [scrapy.crawler] INFO: Overridden settings: {'BOT_NAME': 'zomato_check_in', 'FEED_EXPORT_ENCODING': 'utf-8', 'LOG_FILE': '/home/ubuntu/restaverse_spiders/logs/zomato_check_in/integration_check/a663ddaf92e811f0929fa729f0037d6c.log', 'NEWSPIDER_MODULE': 'zomato_check_in.spiders', 'SPIDER_MODULES': ['zomato_check_in.spiders'], 'TWISTED_REACTOR': 'twisted.internet.asyncioreactor.AsyncioSelectorReactor'} 2025-09-16 10:35:10 [scrapy.middleware] INFO: Enabled downloader middlewares: ['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware', 'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware', 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware', 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware', 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware', 'scrapy.downloadermiddlewares.retry.RetryMiddleware', 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware', 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware', 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware', 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware', 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware', 'scrapy.downloadermiddlewares.stats.DownloaderStats'] 2025-09-16 10:35:10 [scrapy.middleware] INFO: Enabled spider middlewares: ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware', 'scrapy.spidermiddlewares.referer.RefererMiddleware', 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware', 'scrapy.spidermiddlewares.depth.DepthMiddleware'] 2025-09-16 10:35:10 [scrapy.middleware] INFO: Enabled item pipelines: ['zomato_check_in.pipelines.ZomatoCheckInPipeline'] 2025-09-16 10:35:10 [scrapy.core.engine] INFO: Spider opened 2025-09-16 10:35:10 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min) 2025-09-16 10:35:10 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): api.scrapeops.io:443 2025-09-16 10:35:10 [urllib3.connectionpool] DEBUG: https://api.scrapeops.io:443 "POST /api/v1/setup/ HTTP/11" 200 None 2025-09-16 10:35:10 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6029 2025-09-16 10:35:10 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): cron.restaverse.com:443 2025-09-16 10:35:10 [urllib3.connectionpool] DEBUG: https://cron.restaverse.com:443 "POST /api/db_services/fetch-query HTTP/11" 200 3338 2025-09-16 10:35:11 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): api.scrapeops.io:443 2025-09-16 10:35:11 [urllib3.connectionpool] DEBUG: https://api.scrapeops.io:443 "POST /api/v1/normalizer/proxy_port/?job_id=4919331 HTTP/11" 200 None 2025-09-16 10:35:11 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): api.scrapeops.io:443 2025-09-16 10:35:12 [urllib3.connectionpool] DEBUG: https://api.scrapeops.io:443 "POST /api/v1/normalizer/domain/?domain=zomato.com HTTP/11" 200 241 2025-09-16 10:35:13 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/partners/onlineordering/orderHistory/) 2025-09-16 10:35:14 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): api.scrapeops.io:443 2025-09-16 10:35:14 [urllib3.connectionpool] DEBUG: https://api.scrapeops.io:443 "POST /api/v1/normalizer/domain/?domain=restaverse.com HTTP/11" 200 245 2025-09-16 10:35:15 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/webroutes/auth/csrf) 2025-09-16 10:35:16 [integration_check] INFO: Found 6 restaurant IDs from database for client_id 297 2025-09-16 10:35:17 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/partners/onlineordering/outletInfo/contactDetails?resId=21490505) 2025-09-16 10:35:17 [integration_check] INFO: Res ID 21490505: Found emails {'tanna.vinit@zomato.com', 'dashboard@keralacafe.co', 'whitefield@keralacafe.co', 'reviews@restaverse.com', 'bhoomi.jain@zomato.com'}, target: reviews@restaverse.com, zomato_manager_access: True 2025-09-16 10:35:17 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.zomato.com/merchant-api/contacts/v2/get-all?res_id=21490505&business_type=o2> None 2025-09-16 10:35:17 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/partners/onlineordering/outletInfo/contactDetails?resId=18629293) 2025-09-16 10:35:17 [integration_check] INFO: Res ID 18629293: Found emails {'devdhar.bhatia@zomato.com', 'dashboard@keralacafe.co', 'baner@keralacafe.co', 'gagan.singh@zomato.com', 'sagar.rajput@zomato.com', 'reviews@restaverse.com'}, target: reviews@restaverse.com, zomato_manager_access: True 2025-09-16 10:35:17 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.zomato.com/merchant-api/contacts/v2/get-all?res_id=18629293&business_type=o2> None 2025-09-16 10:35:18 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/partners/onlineordering/outletInfo/contactDetails?resId=18415145) 2025-09-16 10:35:18 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/partners/onlineordering/outletInfo/contactDetails?resId=21296252) 2025-09-16 10:35:18 [integration_check] INFO: Res ID 18415145: Found emails {'gim.george@gmail.com', 'kcvaccounts@keralacafe.co', 'tanna.vinit@zomato.com', 'dashboard@keralacafe.co', 'cashierkc@gmail.com', 'reviews@restaverse.com'}, target: reviews@restaverse.com, zomato_manager_access: True 2025-09-16 10:35:18 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.zomato.com/merchant-api/contacts/v2/get-all?res_id=18415145&business_type=o2> None 2025-09-16 10:35:18 [integration_check] INFO: Res ID 21296252: Found emails {'devika.s@zomato.com', 'varmaasandhya@gmail.com', 'dashboard@keralacafe.co', 'reviews@restaverse.com', 'fortkochi@keralacafe.co'}, target: reviews@restaverse.com, zomato_manager_access: True 2025-09-16 10:35:18 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.zomato.com/merchant-api/contacts/v2/get-all?res_id=21296252&business_type=o2> None 2025-09-16 10:35:18 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/partners/onlineordering/outletInfo/contactDetails?resId=18836782) 2025-09-16 10:35:18 [integration_check] INFO: Res ID 18836782: Found emails {'kalyani@keralacafe.co', 'devdhar.bhatia@zomato.com', 'dashboard@keralacafe.co', 'gagan.singh@zomato.com', 'sagar.rajput@zomato.com', 'reviews@restaverse.com'}, target: reviews@restaverse.com, zomato_manager_access: True 2025-09-16 10:35:18 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.zomato.com/merchant-api/contacts/v2/get-all?res_id=18836782&business_type=o2> None 2025-09-16 10:35:18 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://www.zomato.com/partners/onlineordering/outletInfo/contactDetails?resId=20491819) 2025-09-16 10:35:19 [integration_check] INFO: Res ID 20491819: Found emails {'tanna.vinit@zomato.com', 'dashboard@keralacafe.co', 'justingeorge15@gmail.com', 'prashant.satyam32@gmail.com', 'khannabaljindersingh@gmail.com', 'namdev@keralacafe.co', 'reviews@restaverse.com', 'bhoomi.jain@zomato.com'}, target: reviews@restaverse.com, zomato_manager_access: True 2025-09-16 10:35:19 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.zomato.com/merchant-api/contacts/v2/get-all?res_id=20491819&business_type=o2> None 2025-09-16 10:35:19 [scrapy.core.engine] INFO: Closing spider (finished) 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): cron.restaverse.com:443 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: https://cron.restaverse.com:443 "POST /api/db_services/fetch-query HTTP/11" 200 289 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): cron.restaverse.com:443 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: https://cron.restaverse.com:443 "POST /insert_data_with_update/ HTTP/11" 201 91 2025-09-16 10:35:19 [integration_check] INFO: 2025-09-16 10:35:19 [integration_check] INFO: Logger Payload: {'run_id': '41b45070-727d-467d-9c49-b734bde2d4fe', 'timestamp': '2025-09-16T10:35:19Z', 'spider': 'integration_check', 'client_id': '297', 'domain': 'www.zomato.com', 'bytes_sent': 0, 'bytes_received': 42448, 'duration_seconds': 9.04, 'host': 'ip-172-31-16-168', 'status': 'finished'} 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): watchdog.restaverse.com:9200 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: http://watchdog.restaverse.com:9200 "POST /scrapy-2025-09-16/_doc HTTP/11" 201 169 2025-09-16 10:35:19 [integration_check] INFO: Logger Payload: {'run_id': '41b45070-727d-467d-9c49-b734bde2d4fe', 'timestamp': '2025-09-16T10:35:19Z', 'spider': 'integration_check', 'client_id': '297', 'domain': 'cron.restaverse.com', 'bytes_sent': 108, 'bytes_received': 133, 'duration_seconds': 9.04, 'host': 'ip-172-31-16-168', 'status': 'finished'} 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: Starting new HTTP connection (1): watchdog.restaverse.com:9200 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: http://watchdog.restaverse.com:9200 "POST /scrapy-2025-09-16/_doc HTTP/11" 201 168 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): api.scrapeops.io:443 2025-09-16 10:35:19 [urllib3.connectionpool] DEBUG: https://api.scrapeops.io:443 "POST /api/v1/stats/ HTTP/11" 200 108 2025-09-16 10:35:19 [scrapy.statscollectors] INFO: Dumping Scrapy stats: {'client_id': '297', 'downloader/request_bytes': 36038, 'downloader/request_count': 8, 'downloader/request_method_count/GET': 7, 'downloader/request_method_count/POST': 1, 'downloader/response_bytes': 33557, 'downloader/response_count': 8, 'downloader/response_status_count/200': 8, 'elapsed_time_seconds': 9.037953, 'finish_reason': 'finished', 'finish_time': datetime.datetime(2025, 9, 16, 10, 35, 19, 145673, tzinfo=datetime.timezone.utc), 'httpcompression/response_bytes': 42448, 'httpcompression/response_count': 7, 'item_scraped_count': 6, 'items_per_minute': None, 'log_count/DEBUG': 39, 'log_count/INFO': 20, 'memusage/max': 109289472, 'memusage/startup': 109289472, 'request_depth_max': 2, 'response_received_count': 8, 'responses_per_minute': None, 'scheduler/dequeued': 8, 'scheduler/dequeued/memory': 8, 'scheduler/enqueued': 8, 'scheduler/enqueued/memory': 8, 'start_time': datetime.datetime(2025, 9, 16, 10, 35, 10, 107720, tzinfo=datetime.timezone.utc)} 2025-09-16 10:35:19 [scrapy.core.engine] INFO: Spider closed (finished)