Submit

Web Crawler MCP Server

@JonathanHsuHH

A MCP server that provides a web crawling and content extraction tool for AI assistants
Overview

What is Web Crawler MCP Server?

Web Crawler MCP Server is a Model Context Protocol (MCP) server that provides a web crawling and content extraction tool for AI assistants like Claude Desktop and Cursor.

How to use Web Crawler MCP Server?

To use the server, install the necessary dependencies, build the server, and run it directly or configure it as an MCP server in your client.

Key features of Web Crawler MCP Server?

  • Extracts and cleans main text content from any public web page.
  • Utilizes Puppeteer with a stealth plugin to bypass anti-bot protections.
  • Returns readable, whitespace-normalized text suitable for LLM consumption.
  • Easy integration with MCP-compatible clients.

Use cases of Web Crawler MCP Server?

  1. Extracting content from news websites for AI summarization.
  2. Gathering data from public web pages for research purposes.
  3. Enabling AI assistants to access and process web content efficiently.

FAQ from Web Crawler MCP Server?

  • What are the prerequisites for using the server?

You need Node.js (v16 or higher) and an MCP-compatible client like Claude Desktop or Cursor.

  • How do I install the server?

Run npm install to install dependencies and npm run build to build the server.

  • Can I run the server in a headless mode?

Yes, but the server launches a real browser instance for best compatibility.

© 2025 MCP.so. All rights reserved.

Build with ShipAny.