[Bug]: CORS (Cross-Origin Resource Sharing) error when trying to use Crawl4AI to connect to Twitter. #647

alvaro562003 · 2025-02-08T15:37:20Z

alvaro562003
Feb 8, 2025

crawl4ai version

Version: 0.4.248

Expected Behavior

I am encountering a CORS (Cross-Origin Resource Sharing) error when trying to use Crawl4AI to connect to Twitter. Crawl4AI is failing to load essential scripts from Twitter's domain (abs.twimg.com), which is preventing proper connection.

Here are the console error messages I am consistently seeing in the logs:

CONSOLE]. ℹ Console: Access to script at 'https://abs.twimg.com/responsive-web/client-web/vendor.c4b9145a.js' from origin 'https://twitter.com' has been blocked by CORS policy: Response to preflight request doesn't pass access control check: No 'Access-Control-Allow-Origin' header is present on the requested resource.
[CONSOLE]. ℹ Console: Failed to load resource: net::ERR_FAILED

Current Behavior

program stops with error logs

Is this reproducible?

Yes

Inputs Causing the Bug

-URLS : https://www.x.com
NB : i used so many configurations, i prefer send the minimalist config version.

Steps to Reproduce

launch the python code on terminal  and look at the console.

Code snippets

site_url = "https://www.x.com"

import asyncio
import nest_asyncio
from crawl4ai import AsyncWebCrawler, CacheMode, BrowserConfig, CrawlerRunConfig
from crawl4ai.markdown_generation_strategy import DefaultMarkdownGenerator
import logging

# Configuration du logging
logging.basicConfig(level=logging.DEBUG, 
                   format='%(asctime)s - %(levelname)s - %(message)s')
logger = logging.getLogger('twitter_crawler')

# Apply nest_asyncio to allow nested event loops
nest_asyncio.apply()

async def main():
    # Configuration optimisée pour Twitter
    browser_conf = BrowserConfig(
        headless=False,  # Mode visible pour le debug
    )
    
    crawler_config = CrawlerRunConfig(
        cache_mode=CacheMode.DISABLED, #        cache_mode=CacheMode.BYPASS/DISABLED,
        log_console=True,  # Activation des logs console
    )

    try:
        async with AsyncWebCrawler(
            config=browser_conf,
            verbose=True,
        ) as crawler:
            result = await crawler.arun(
                url=site_url ,
                config=crawler_config
            )
            if result.success:
                logger.info("Longueur du HTML capturé: %d", len(result.html or ''))
                    
    except Exception as e:
        logger.error(f"Erreur générale: {str(e)}")
        raise e

if __name__ == "__main__":
    asyncio.run(main())

OS

window

Python version

Python 3.11.9

Browser

chromium

Browser version

ersion 133.0.6943.16 (Build officiel) (64 bits)

Error logs & Screenshots (if applicable)

Answered by aravindkarnam

Feb 10, 2025

@alvaro562003 Like @Tauvic suggested, we should add '--disable-web-security' to ignore CORS errors. You can pass this flag through extra_args key in BrowserConfig as follows:

    browser_conf = BrowserConfig(
        headless=False,
        extra_args=['--disable-web-security']
    )

Converting this to Forums, so that others may find this information easily.

View full answer

Wadehl · 2025-02-08T16:01:28Z

Wadehl
Feb 8, 2025

The same problem occurs while crawling Facebook.

0 replies

Tauvic · 2025-02-09T19:46:20Z

Tauvic
Feb 9, 2025

Checkout: microsoft/playwright#17631

0 replies

aravindkarnam · 2025-02-10T05:26:12Z

aravindkarnam
Feb 10, 2025
Collaborator

@alvaro562003 Like @Tauvic suggested, we should add '--disable-web-security' to ignore CORS errors. You can pass this flag through extra_args key in BrowserConfig as follows:

    browser_conf = BrowserConfig(
        headless=False,
        extra_args=['--disable-web-security']
    )

Converting this to Forums, so that others may find this information easily.

0 replies

alvaro562003 · 2025-02-11T07:17:34Z

alvaro562003
Feb 11, 2025
Author

Hi aravindkarnam,
thank you for your response : it works !!!! great

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: CORS (Cross-Origin Resource Sharing) error when trying to use Crawl4AI to connect to Twitter. #647

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

[Bug]: CORS (Cross-Origin Resource Sharing) error when trying to use Crawl4AI to connect to Twitter. #647

alvaro562003 Feb 8, 2025

crawl4ai version

Expected Behavior

Current Behavior

Is this reproducible?

Inputs Causing the Bug

Steps to Reproduce

Code snippets

OS

Python version

Browser

Browser version

Error logs & Screenshots (if applicable)

Replies: 4 comments

Wadehl Feb 8, 2025

Tauvic Feb 9, 2025

aravindkarnam Feb 10, 2025 Collaborator

alvaro562003 Feb 11, 2025 Author

alvaro562003
Feb 8, 2025

Wadehl
Feb 8, 2025

Tauvic
Feb 9, 2025

aravindkarnam
Feb 10, 2025
Collaborator

alvaro562003
Feb 11, 2025
Author