Machine Learning View

What the Models Suggest (Plain Language)

This page translates model output into practical answers: expected traffic, unusual surges, who returns, where people go next, recurring error themes, and countries with unusual traffic patterns.

How To Read This Page

Forecasts: directional estimates, not exact counts.
Behavior charts: percentages are easier to trust than raw scores.
Anomalies: these are review candidates, not confirmed issues.
Recommendations: treat as hints for UI prompts and cross-links.
Sessions Analyzed12,591170,685 tracked events
Return Model Score64/10050 = random guess, 100 = perfect
Unusual Traffic Surges14KG Explorer has the biggest jump
Error Themes444,660 error rows clustered
Countries To Review125 look likely automated

Expected Visits Over The Next 6 Months

Each line is a tool. Hover to see likely low/high range.

Forecast
Biggest projected month in this run: KG Explorer at 6,857 visits in Aug 2026 (likely range 3,556 - 10,087).

Big Traffic Jumps We Should Explain

Bars show extra visits compared with the previous month.

Surges

Largest jump in this run: KG Explorer in Jul 2025.

Who Comes Back After A Session?

Blue bars = number of sessions. Orange line = real return rate.

Return Behavior
Strongest return signal: early page views · strongest drop-off signal: early events.

Cohort Retention — Do Users Come Back?

Each line is a monthly cohort. Drops show how retention decays over time.

Cohort Analysis

Hollow circles = Apr '26 data (partial month, undercounts real retention).

Reading the chart: Row = cohort month, Column = months after first visit. Darker blue = more users returned. Tracked via persistent cookie (anon_id).
Month 0 = 100% by definition — these are the users that define the cohort. Watch for how quickly each row fades.
Cookie tracking started Oct 2025, so only recent cohorts appear. Older visits lack persistent IDs for user-level retention measurement.

Session Types (Simple Grouping)

The model groups sessions by depth, duration, and interaction style.

Audience Types

Largest group: Regular Researchers (66.9%).

Where People Go Next Between Tools

Rows are current tool, columns are next tool in the same session.

User Journeys

Most common next-step path: KG ExplorerCDE (39.4%, 28 transitions).

Behaviors That Happen Together

How often the right-side action happens after the left-side action.

Behavior Pairings

Strongest pairing right now: organ selection + KG Explorer -> download export (confidence 53.1%).

Main Error Themes

~55% are uncontrollable network failures · ~25% are fixable KG Explorer icon bugs · ~3% is dev noise

Errors

Largest cluster contributes 49.9% of error rows.

Countries With Unusual Traffic Patterns

Bar = bot-like traffic share. Use as a review list, not a final verdict.

Geo Review

Highest-priority review country in this run: Finland.

Confidence Note

Return model score is 64/100, which is useful for ranking risk but not perfect for exact individual prediction.

Bot Signal Driver

Most influential bot feature is user-agent string length in this run.

Journey Coverage

121 sessions contained multi-step tool journeys we could model.

Data Coverage

Last ML run processed 121 monthly tool points and 12,591 session-level transactions.