Help

Parquet Field Dictionary

Both HRA and CNS dashboards share the same CloudFront log schema (40 fields). This page documents all fields.

HRA (humanatlas.io)
Parquet: data/2026-04-06_hra-logs.parquet
Date range: 2023-06-062026-04-06
Sites: Apps, Portal, KG, API, CDN, Events
CNS (cns.iu.edu)
Parquet: data/2026-04-06_cns-logs.parquet
Date range: 2008-04-132026-04-06
Sites: CNS

Top-Level Parquet Fields

FieldTypeInformationUsed For
anon_idVARCHAR · nullableAnonymous identifier for a visitor/session actor.Session grouping, retention, and repeat-visitor analytics.
dateDATE · nullableRequest date.Daily/monthly/yearly trend views and period filters.
timeVARCHAR · nullableRequest time component.Hour-of-day activity analysis.
x_edge_locationVARCHAR · nullableCloudFront edge location code that served the request.Infra routing diagnostics and coarse geo context.
sc_bytesBIGINT · nullableResponse bytes sent by server.Payload-size analysis and traffic profiling.
cs_methodVARCHAR · nullableHTTP request method (GET, POST, etc.).Request profiling and endpoint behavior checks.
cs_uri_stemVARCHAR · nullablePath part of the URL (without query params).Tool/page visit counting and route-level analytics.
sc_statusINTEGER · nullableHTTP status code returned to client.Error-rate and reliability metrics.
cs_refererVARCHAR · nullableIncoming referrer URL header.Source attribution and external ecosystem analysis.
cs_user_agentVARCHAR · nullableUser-agent string from client.Client profiling and bot/human heuristics.
cs_uri_queryVARCHAR · nullableRaw URL query string.Parameter extraction and event-context parsing.
cs_cookieVARCHAR · nullableCookie header from request.Session continuity and identity context.
x_edge_result_typeVARCHAR · nullableCloudFront result category.Infra/cache behavior diagnostics.
x_edge_request_idVARCHAR · nullableUnique CloudFront request id.Request traceability and de-dup checks.
x_host_headerVARCHAR · nullableHost header requested by client.Domain-level segmentation and host routing checks.
cs_protocolVARCHAR · nullableProtocol scheme used by client request.Transport-level diagnostics.
cs_bytesBIGINT · nullableRequest bytes sent by client.Upload/payload profiling.
time_takenDOUBLE · nullableTotal time to serve request.Latency and performance monitoring.
ssl_protocolVARCHAR · nullableTLS protocol version.Security/transport compatibility diagnostics.
ssl_cipherVARCHAR · nullableTLS cipher used for the request.Security posture and transport telemetry.
x_edge_response_result_typeVARCHAR · nullableCloudFront response result type.Delivery outcome and cache/error analysis.
cs_protocol_versionVARCHAR · nullableHTTP protocol version.Network/client compatibility analysis.
time_to_first_byteDOUBLE · nullableTime until first byte is returned.Backend/network latency monitoring.
x_edge_detailed_result_typeVARCHAR · nullableDetailed CloudFront result reason.Infra troubleshooting for specific failure classes.
sc_content_typeVARCHAR · nullableResponse content MIME type.Asset/API/document classification.
sc_content_lenBIGINT · nullableResponse content length.Payload distribution and bandwidth analysis.
sc_range_startBIGINT · nullableRange start for partial content responses.Media/file transfer diagnostics.
sc_range_endBIGINT · nullableRange end for partial content responses.Media/file transfer diagnostics.
timestampBIGINT · nullableEvent timestamp in source format.Event ordering and temporal feature engineering.
timestamp_msBIGINT · nullableEvent timestamp in milliseconds.Fine-grained sequencing and latency math.
c_countryVARCHAR · nullableCountry derived from client/edge location.Geo usage trends and country comparisons.
queryMAP(VARCHAR, VARCHAR) · nullableParsed query-parameter map.Event/tool context and parameter-level analysis.
traffic_typeVARCHAR · nullableTraffic classification label (for example likely human or bot).Filtering analytics to desired traffic segments.
referrerVARCHAR · nullableNormalized referrer category/value.Traffic-source reporting.
airportVARCHAR · nullableAirport/location code derived from edge location context.Regional infrastructure distribution analysis.
monthINTEGER · nullableMonth component derived from date.Monthly aggregation and trend charting.
dayINTEGER · nullableDay component derived from date.Daily aggregation and anomaly detection.
distributionVARCHAR · nullableDistribution/environment identifier.Comparing traffic across deployment distributions.
siteVARCHAR · nullableHigh-level site/category label for the event.Separating Apps vs Events traffic in dashboards.
yearINTEGER · nullableYear component derived from date.Year-over-year trend analysis.