Apify Announces the Launch of Crawlee for Python

Distributed by Globe Newswire
23rd July 2024

Prague, Czech Republic, July 23, 2024 (GLOBE NEWSWIRE) —

Apify, the world’s leading cloud platform for developing and running web scraping solutions, is excited to announce the launch of Crawlee for Python, a web scraping and browser automation library that helps users build fast and reliable crawlers.

Crawlee was created by a team of experts who scrape for a living and extract data from millions of web pages daily. Building upon the original Crawlee for Node.js, launched in 2022, Crawlee for Python offers an open-source solution that simplifies web crawler development.

“One of the main advantages of Crawlee is that the library has a single interface for both HTTP and headless browsers,” says Jan Čurn, CEO of web scraping and automation platform Apify. “You can write your crawlers using the same base abstraction, and the framework takes care of the heavy lifting such as parallelization, proxy rotation, and scaling.”

Crawlee for Python is developed and maintained by Apify. With clients including Siemens, Intercom, Microsoft, Groupon, and Accenture, Apify has become acclaimed in the industry for its innovative web scraping platform and marketplace for developers to monetize their software. Its open-source web scraping library, Crawlee, is designed to help devs build and maintain their crawlers faster.

“Developers of scrapers shouldn’t need to reinvent the wheel and can just focus on building the ‘business’ logic of their scrapers,” Čurn adds.

Some of the key features of the Crawlee for Python launch include:

Unified interface for HTTP and headless browser crawling.
HTTP: HTTPX with Beautiful Soup.
Headless browser: Users can switch their browsers from HTTP to a headless browser in 3 lines of code. Accessible with Chrome, Firefox, and other popular browsers, Crawlee builds on top of Playwright and adds its own features.
Automatic parallel crawling based on available system resources.
Written in Python with type hints to offer better DX (IDE autocompletion) and fewer bugs (static type checking).
Automatic retries on errors or when you’re getting blocked.
Integrated proxy rotation and session management.
Configurable request routing – direct URLs to appropriate handlers.
Persistent queue for URLs to crawl.
Pluggable storage of both tabular data and files.
Crawlee is built on Asyncio, so it’s fully asynchronous.

With an active Discord community of over 8,000 web scraping developers, an array of excellent benefits, and fully open source, Crawlee for Python prioritizes high-quality, readable, and maintainable code and reliable crawlers.

Apify encourages anyone interested in learning more about its Crawlee for Python announcement to try out the new web scraping and automation library today on the Crawlee website, where they can also join the Discord community.

About Apify

Founded in 2015, Apify has become renowned as the most flexible full-stack platform for web scraping and browser automation. With a commitment to making the web more programmable and automating mundane, repetitive tasks, Apify is where developers build, deploy, and publish web scraping, data extraction, and web automation tools.

More Information

To learn more about Apify and the launch of Crawlee for Python, please visit https://apify.com.

Source: https://thenewsfront.com/apify-announces-the-launch-of-crawlee-for-python/

CONTACT: Apify
Lucerna Palace
Vodickova 704/36
Prague 110 00
Czechia

https://apify.com

[email protected]

Fintech Jobs

Guavapay and Rangers Football Club Announce New Multi-year Partnership

22nd November 2024

Globe Newswire

Apify Announces the Launch of Crawlee for Python

READ NEXT

Leave a comment Cancel reply

Fintech Jobs

Related Content

Top stories

The hottest news this week

Media Packs

FinTech Futures Media Pack

FinTech Futures Sibos Media Pack

Webinar | 27 November 2024 | EMEA fintechs: unlock innovation with generative AI with AWS and NVIDIA

Webinar | 28 November 2024 | AI in financial services: Navigating the evolving regulatory landscape

Research report: Revenue enablement in financial services – 2024 global findings & insights

White paper: How AI is propelling innovation in financial services

Global survey report: Privacy in practice 2024

White paper: Cyberattacks in the financial services industry

E-book: The promise and peril of the AI revolution – managing risk

Report: It’s prime time for real-time 2024 – real-time payments adoption and growth around the globe

Banking Technology Magazine November 2024 issue out now

Upcoming events

Banking Tech Awards 2024

PayTech Awards USA 2024

Banking Tech Insights

What the FinTech? | S.5 Episode 21 | Taking open banking payments mainstream

Demystify Podcast: Demystifying legacy modernisation with Irene Sandler, CMO of Mechanical Orchard

What the FinTech? | S.5 Episode 20 | The future of fraud prevention – live at Money20/20 USA

What the FinTech? | S.5 Episode 19 | Driving growth and customer satisfaction in digital banking

Video: Experian at Money20/20 USA 2024 – Combining GenAI and rich data to drive innovation

Video: WorkWave at Money20/20 USA 2024 – Driving growth for field services companies

Video: Codat at Money20/20 USA 2024 – Innovation in B2B payments

Video: Form3 at Money20/20 USA 2024 – The evolution of instant payments

Video: DailyPay at Money20/20 USA 2024 – The growing demand for earned wage access

Sibos 2024 Content Hub – news and coverage from Beijing

Content Hub: Banking Tech Awards 2023 winners

FinTech Founders Video Series: how to build and run a start-up

OKX Launches ‘Creators Collective’: Exclusive Community Welcomes Inaugural Cohort of Onchain Builders, Curators and Artists

BlackRock Enhanced Capital and Income Fund, Inc. Approves Name and Investment Policy Changes

CoreStack Ranked Number 235 Fastest-Growing Company in North America on the 2024 Deloitte Technology Fast 500™

PatientFi Ranked 54th Fastest-Growing Company in North America on the 2024 Deloitte Technology Fast 500™

Guavapay and Rangers Football Club Announce New Multi-year Partnership

$170+ Bn Payment Gateway Global Market Opportunities and Strategies to 2033 with Amazon Payments, PayPal, Stripe, Visa, and Fiserv Dominating

data.org Launches Asia Pacific Data Capacity Accelerator

Airship Announces 2024 Altitude Award Winners