Web User Session Reconstruction Using Integer Programming
Dell, Robert F.
Román, Pablo E.
Velásquez, Juan D.
MetadataShow full item record
An important input for web usage mining is web user sessions that must be reconstructed from web logs (sessionization) when such sessions are not otherwise identified. We present a novel approach for sessionization based on an in- teger program. We compare results of our approach with the timeout heuristic on web logs from an academic web site. We find our integer program provides sessions that better match an expected empirical distribution with about half of the standard error of the heuristic.