Exploring Web analytics live traffic
The Web analytics Live tab (/web/live) shows real-time activity over a 30-minute sliding
window plus a 60-second "users online" count. It is the place to answer "what is happening
on my site right now?" — pageviews, named bots, devices, geo, top paths, top referrers, and
a live event feed.
This skill teaches you (the agent) how to:
- recognize a request that belongs on the Live tab
- read the tile model (what each card shows, where the data comes from)
- manipulate the only filter that exists (host)
- build product-analytics insights that match a Live tile when the user wants longer time ranges or deeper drill-down than the live window offers
The Live tab is not a HogQL playground — its data comes from a livestream backed by short HogQL backfills. When the user wants to query "right now" data with HogQL, point them at the tab; when they want historical breakdowns, build an insight with the patterns below.
When to use this skill
Use this skill when the user:
- asks "who is on my site right now?", "what is happening live?", "show me live traffic"
- mentions the "Live" tab, the "Live dashboard", or the live page (
/web/live) - asks about live bot traffic ("which bots are crawling me?", "is GPTBot scraping us?")
- wants to filter live traffic by domain / host
- wants to compare what they see on the Live tab to a longer time window — e.g. "the live tab shows GPTBot is hammering us, can you give me a 7-day chart of that?"
Do not use this skill for non-realtime web analytics work — for that, use the standard
Web analytics tab (/web).
Tab structure
URL: /web/live
The tab has two filter affordances and a grid of tiles. Date range is fixed: 30 minutes sliding window for everything except "Users online" (last 60 seconds).
Filters
There is only one filter on the live tab: the host (domain) selector.
- It comes from
webAnalyticsFilterLogic.selectedHost. - It is shared with the rest of Web analytics, so changing it on
/webpropagates to/web/liveand vice-versa. - It is gated by feature flag
WEB_ANALYTICS_LIVE_DOMAIN_FILTER. If the flag is off, no host filter UI is rendered and all tiles show data across every domain. - Setting the host filter narrows: the SSE stream, the HogQL backfill queries (so the initial 30 min is host-scoped), and the "users online" count.
- There is no date picker, no compare control, no property filters, no test-account filter on the Live tab. Do not promise the user controls that don't exist.
When the user asks "filter live traffic by domain <host>", direct them to the Domain
selector at the top of the Live tab. There is no URL param to set it directly — it
persists in localStorage via webAnalyticsFilterLogic.
Stat cards (top strip)
| Card | What | Window |
|---|---|---|
| Users online | Distinct device IDs seen in the last 60 seconds | 60s |
| Unique visitors | Distinct device IDs in the last 30 min | 30m |
| Pageviews | $pageview count in the last 30 min |
30m |
Content cards
| Card | What | Notes |
|---|---|---|
| Active users per minute | Bar chart, new vs returning visitors | last 30 min |
| Top pages | Animated leaderboard, $pathname + view count |
top 10, 30 min |
| Top referrers | Animated leaderboard, $referring_domain |
top 10, 30 min |
| Devices | Breakdown bars, $device_type |
top 6 + Other |
| Browsers | Breakdown bars with logos, $browser |
top 6 + Other |
| Top countries | Breakdown bars, $geoip_country_code |
top 6 + Other; replaced by a Country/City tab card if WEB_ANALYTICS_LIVE_CITY_BREAKDOWN is on |
| Bot requests per minute | Bar chart, bot events / minute | flag WEB_ANALYTICS_BOT_ANALYSIS |
| Bot traffic | Named bots ranked by event share, with category tag | flag WEB_ANALYTICS_BOT_ANALYSIS; rows are clickable and open an insight for that specific bot |
| Countries (world map) | SVG world map heat | flag WEB_ANALYTICS_LIVE_MAP |
| Live events | Streamed event feed (event, person, URL, timestamp) | last 50 events |
Every tile (except the live event feed and world map) has an "Open as new insight" button that opens a 7-day Trends query in product analytics. The bot traffic tile rows are also individually clickable — clicking a bot row opens a single-bot trend.
Bot detection model
Bots are detected server-side. Three virtual properties are attached to the event before it lands in ClickHouse:
$virt_is_bot— boolean,trueif classified as a bot$virt_bot_name— string, the bot's display name (e.g.Googlebot,GPTBot,Claude,Lighthouse,HeadlessChrome)$virt_traffic_category— string, the category key:ai_crawler,ai_search,ai_assistant,search_crawler,seo_crawler,social_crawler,monitoring,http_client,headless_browser,no_user_agent,regular
The Live bot tiles count "bot-eligible" events: $pageview, $pageleave, $screen,
$http_log, $autocapture. $http_log is included because most bots emit server-side
HTTP logs rather than JS pageviews.
Building product-analytics queries that mirror the Live tab
When the user wants a longer window, a saved insight, a dashboard tile, or to share a view of what's on the Live tab, build a Trends insight. The "Open as new insight" buttons in the UI use exactly these recipes:
Bot traffic breakdown (matches the bot tile header)
A single chart of all bots over time, broken down by name. This is the canonical "who's crawling me?" view.
{
"kind": "TrendsQuery",
"interval": "hour",
"dateRange": { "date_from": "-7d" },
"series": [
{
"kind": "GroupNode",
"custom_name": "Requests",
"operator": "OR",
"math": "total",
"nodes": [
{ "kind": "EventsNode", "event": "$pageview", "math": "total" },
{ "kind": "EventsNode", "event": "$pageleave", "math": "total" },
{ "kind": "EventsNode", "event": "$screen", "math": "total" },
{ "kind": "EventsNode", "event": "$http_log", "math": "total" },
{ "kind": "EventsNode", "event": "$autocapture", "math": "total" }
]
}
],
"properties": [{ "key": "$virt_is_bot", "value": ["true"], "operator": "exact", "type": "event" }],
"breakdownFilter": {
"breakdown": "$virt_bot_name",
"breakdown_type": "event",
"breakdown_limit": 25
},
"trendsFilter": { "display": "ActionsBarValue" }
}
Single bot drill-down (matches a clicked bot row)
{
"kind": "TrendsQuery",
"interval": "hour",
"dateRange": { "date_from": "-7d" },
"series": [
/* same combined "Requests" GroupNode as above */
],
"properties": [
{ "key": "$virt_is_bot", "value": ["true"], "operator": "exact", "type": "event" },
{ "key": "$virt_bot_name", "value": ["GPTBot"], "operator": "exact", "type": "event" },
{ "key": "$virt_traffic_category", "value": ["ai_crawler"], "operator": "exact", "type": "event" }
],
"trendsFilter": { "display": "ActionsLineGraph" }
}
The category filter is optional — include it when the user asks about a specific
bot+category combo (Lighthouse · headless_browser is a different signal from
Lighthouse · monitoring).
Bot category breakdown (matches the bot events chart tile)
Use breakdown by $virt_traffic_category instead of $virt_bot_name when the user
wants "AI crawlers vs SEO crawlers vs everything else" rather than per-bot rows.
Top pages / referrers / devices / browsers / countries
For non-bot tiles, use $pageview with math: unique_users, breakdown by the
underlying property:
| Tile | breakdown property | display |
|---|---|---|
| Top pages | $pathname |
ActionsBarValue |
| Top referrers | $referring_domain |
ActionsBarValue |
| Devices | $device_type |
ActionsPie |
| Browsers | $browser |
ActionsPie |
| Countries | $geoip_country_code |
WorldMap |
Always inherit the live tab's host filter when the user is asking about a specific
domain — add { "key": "$host", "value": ["<host>"], "operator": "exact", "type": "event" }
to properties.
Defaults to use
dateRange.date_from:-7dunless the user names a window — the live view itself is 30 min, but the user is almost always asking about a longer window when they request an insight version.interval:hourfor 7-day windows,minuteonly for windows under a day,dayfor windows beyond 14 days.- Always inherit the host filter when one is set on the Live tab. Don't drop it silently — that changes the answer.
Common requests and the right move
| User says | Right move |
|---|---|
| "What's happening on the site right now?" | Send them to /web/live |
"Filter live traffic to example.com" |
Use the Domain selector at top of /web/live |
| "Show me bots crawling us in the last 30 min" | /web/live → Bot traffic tile |
| "Show me bots crawling us this week" | Build the "Bot traffic breakdown" insight above with date_from: -7d |
| "How much is GPTBot hitting us?" | Build the "Single bot drill-down" insight, set $virt_bot_name to GPTBot |
| "Why is the live tab showing X but my dashboard shows Y?" | The live tab is a 30-min sliding window over events; dashboards aggregate over the picked range. They are not directly comparable beyond the last 30 min. |
| "Add a date range to the live tab" | The Live tab has no date picker — for ranges, build a Trends insight using the patterns above |
| "Filter live traffic by browser / device / country" | Not supported — only the host filter exists. Build a Trends insight with the relevant breakdown + filter instead |
Gotchas
- Bot virtual properties (
$virt_*) only exist on events processed by the bot classification step. They are not retroactive — events from before the classifier shipped will not have them. KeepdateRange.date_fromwithin the last few months for reliable bot results. $http_logevents come from server-side log capture, not fromposthog-js. If a project does not emit$http_log, bots that don't run JS (most crawlers) will be invisible to the bot tiles.- The 30-minute window is a sliding aggregation over an in-memory buffer in the browser — refreshing the page replays the backfill HogQL, not the SSE stream. Do not interpret a brief "0" right after page load as a real drop.
- The host filter strips the protocol — pass
example.com, nothttps://example.com. - Tile order is persisted per-team in
localStorage(under feature flagWEB_ANALYTICS_LIVE_EDIT_LAYOUT). If a user's layout looks different from yours, it is not a bug.