> ## Documentation Index
> Fetch the complete documentation index at: https://docs.trychroma.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Web Sync

> Crawl and sync website content into Chroma Cloud.

Web Sync allows you to easily sync content from any publicly accessible website into your Chroma Cloud database. Given a starting URL, Sync will crawl the website and its links up to a specified depth, extracting the content as Markdown, chunking it, and inserting it into your Chroma database with embeddings.

# Walkthrough

If you do not already have a Chroma Cloud account, you will need to create one at [trychroma.com](https://www.trychroma.com). After creating an account, you can create a database by specifying a name:

<img src="https://mintcdn.com/chroma-8943dec5/N-xA4EbmHOvIcCcs/images/sync/sync_web_new_db.png?fit=max&auto=format&n=N-xA4EbmHOvIcCcs&q=85&s=a95b9721659ae44ff9348e38c6b25003" alt="Create database screen" width="2536" height="1928" data-path="images/sync/sync_web_new_db.png" />

Then, select the Web source during onboarding:

<img src="https://mintcdn.com/chroma-8943dec5/N-xA4EbmHOvIcCcs/images/sync/sync_web_onboarding.png?fit=max&auto=format&n=N-xA4EbmHOvIcCcs&q=85&s=a3346a4ef4a8ce323d029097e717ab71" alt="Onboarding screen" width="2536" height="1928" data-path="images/sync/sync_web_onboarding.png" />

Next, configure the Web source by providing a starting URL:

<img src="https://mintcdn.com/chroma-8943dec5/N-xA4EbmHOvIcCcs/images/sync/sync_web_url_config.png?fit=max&auto=format&n=N-xA4EbmHOvIcCcs&q=85&s=b1f9167131b02bd843f1fdfc2604246e" alt="Web source config" width="2536" height="1928" data-path="images/sync/sync_web_url_config.png" />

Optionally, you can configure other parameters like the page limit and include path regexes. Here, we're scraping a maximum of 50 pages under `https://docs.trychroma.com/cloud` (all our cloud docs):

<img src="https://mintcdn.com/chroma-8943dec5/N-xA4EbmHOvIcCcs/images/sync/sync_web_advanced_config.png?fit=max&auto=format&n=N-xA4EbmHOvIcCcs&q=85&s=a9a61673fbf9703ff59b6f3096eba02d" alt="Web source config" width="2536" height="1928" data-path="images/sync/sync_web_advanced_config.png" />

You can also change the default collection name if you want. After clicking "Create Sync Source", an initial sync will start:

<img src="https://mintcdn.com/chroma-8943dec5/N-xA4EbmHOvIcCcs/images/sync/sync_web_progress.png?fit=max&auto=format&n=N-xA4EbmHOvIcCcs&q=85&s=9b137c480f6263447fa5ae2e18834250" alt="Web sync in progress" width="2536" height="1928" data-path="images/sync/sync_web_progress.png" />

After it finishes, you'll be redirected to the created collection.
