GET
/
{guru_slug}
/
crawl
/
{crawl_id}
/
status
curl --request GET \
  --url https://api.gurubase.io/api/v1/{guru_slug}/crawl/{crawl_id}/status/ \
  --header 'x-api-key: <api-key>'
{
    "id": 211,
    "url": "https://getanteon.com/",
    "status": "COMPLETED",
    "guru_type": "anteon",
    "discovered_urls": ["https://getanteon.com/features/", "https://getanteon.com/pricing/"],
    "start_time": "2025-02-21T10:25:22.710211Z",
    "end_time": "2025-02-21T10:35:45.521433Z",
    "link_limit": 1500
}

Retrieves the current status and details of a specific crawl operation. This endpoint allows you to monitor the progress of an ongoing crawl or check the results of a completed crawl.

When the crawl is completed, the discovered URLs are not indexed immediately. You need to manually add the discovered URLs as a data source by passing them to Create Data Source endpoint.

Path Parameters

guru_slug
string
required

The slug of the Guru type associated with the crawl

crawl_id
string
required

The unique identifier of the crawl operation to check

Response

id
string

Unique identifier for the crawl operation

url
string

The root URL that was crawled

status
string

Current status of the crawl operation

guru_type
string

The Guru type that the crawl was initiated for

discovered_urls
list

List of URLs discovered during crawling

start_time
string

Timestamp when crawl started (ISO 8601 format)

end_time
string

Timestamp when crawl ended (ISO 8601 format)

error_message
string

Error message if the crawl failed (only present if there was an error)

{
    "id": 211,
    "url": "https://getanteon.com/",
    "status": "COMPLETED",
    "guru_type": "anteon",
    "discovered_urls": ["https://getanteon.com/features/", "https://getanteon.com/pricing/"],
    "start_time": "2025-02-21T10:25:22.710211Z",
    "end_time": "2025-02-21T10:35:45.521433Z",
    "link_limit": 1500
}