What Is Behavioral Analytics? UEBA for Detection
Behavioral analytics models normal activity for each user and entity, then scores deviations from that baseline as potential security risk.
A signature catches what someone has already seen. A stolen credential used by an attacker who logs in during business hours, from a normal-looking endpoint, with valid Kerberos tickets, trips none of them. There is no malware to hash, no exploit to fingerprint, no known-bad domain to block. The session looks like the user, because it is using the user's account. What gives it away is not what it is, but what it does: a service account that suddenly enumerates the domain, a finance analyst who pulls ten thousand records at 2 a.m., a workstation that starts talking SMB to hosts it has never touched. That gap, between identity and behavior, is what behavioral analytics is built to close.
Behavioral analytics in security is the practice of modeling what normal activity looks like for each user and entity, then detecting and scoring the activity that deviates from it. The packaged form most defenders meet is User and Entity Behavior Analytics (UEBA), a category Gartner introduced around 2015. This guide covers what behavioral analytics actually is, how baselining works step by step, how it differs from signature-based detection, the telemetry it runs on, the use cases it earns its keep on, the role of machine learning and false-positive tuning, and where it sits inside a SIEM and a SOC. It is written for the people who triage the alerts it produces and reconstruct the incidents it catches.
What is behavioral analytics in security?
Behavioral analytics is the detection approach that learns a baseline of normal behavior for an entity, then treats statistically significant departures from that baseline as signals worth investigating. The entity is usually a user, but it can be a service account, a host, a server, an application, or an IP address, anything whose activity has a pattern worth learning. The output is rarely a flat allow or deny. It is a risk score that rises as an entity's behavior diverges from its own history and from its peers.
The "and entity" in UEBA matters. Early User Behavior Analytics (UBA) watched humans. But a compromised service account or a beaconing host has no human behind it, and those are exactly the things attackers abuse after an initial foothold. Adding entities to the model means the same machinery that flags a user logging in from a new country also flags a database server that starts initiating outbound connections it never made before.
The premise is simple and the consequence is large: you do not need to know what the attack looks like in advance. Signature detection answers "have I seen this exact bad thing before?" Behavioral analytics answers "is this entity acting unlike itself?" That reframing is what lets it catch insider misuse, account takeover, and novel tradecraft that no rule was written for.
How baselining works: collect, model, score
Behavioral analytics runs as a loop with three moving parts: collect telemetry, model normal, score deviations. Each one is where it succeeds or fails.
Collect telemetry. The system ingests activity data across the environment: authentication events, endpoint process and file activity, network flows, cloud API calls, data-access logs. The richer and cleaner the telemetry, the better the model. Garbage in, garbage out applies literally here; a baseline built on partial or mistimed logs produces a model of a fiction.
Model normal. Over a learning window, typically days to weeks, the system builds a profile per entity. What hours does this user work? Which systems do they touch? How much data do they normally move? It also builds peer-group profiles: what does "normal" look like for the finance team, or for domain controllers as a class? Peer comparison is what catches the entity that looks fine against a generic threshold but is wildly out of step with its own cohort.
Score deviations. Once a baseline exists, new activity is compared against it. A deviation is not an automatic alert. Each anomaly contributes to a risk score, and scores accumulate. One off-hours login is noise. An off-hours login, followed by access to a system the user has never touched, followed by a large download, is a rising score that crosses a threshold and surfaces as an alert. Scoring the chain rather than the single event is what keeps the approach usable.
The loop never really ends. Baselines drift as people change roles, teams adopt new tools, and business cycles shift. A model that is not retrained ages into either silence or noise.
Behavioral analytics vs signature-based detection
The two are complementary, not competing, but they answer different questions and fail in different ways. Signature-based detection, the engine behind antivirus, most intrusion detection system rules, and IOC matching, compares activity against a library of known-bad patterns: a file hash, a byte sequence, a domain, a rule like "process X spawning process Y." It is fast, precise, and cheap to explain. Its limit is built into its design: it can only catch what someone has already catalogued. A novel technique, a living-off-the-land attack using only built-in tools, or a legitimate credential in the wrong hands has no signature to match.
Behavioral analytics inverts the question. Instead of a library of bad, it carries a model of normal, and flags the deviation. That is why it catches the things signatures miss: the insider who is authorized for everything they touch, the attacker who brought no malware, the account takeover that uses valid credentials. The cost is the inverse weakness. Signatures produce few false positives but miss the unknown; behavioral analytics catches the unknown but produces more false positives, because "unusual" and "malicious" are not the same thing.
| Dimension | Signature-based detection | Behavioral analytics (UEBA) |
|---|---|---|
| Core question | Have I seen this exact bad thing before? | Is this entity acting unlike itself? |
| Detection basis | Known patterns: hashes, IOCs, rules | Deviation from a learned baseline |
| Catches | Known malware, known attacks | Insider misuse, account takeover, novel tradecraft |
| Misses | Novel and signatureless attacks | Malice that looks statistically normal |
| Output | Binary match or no match | Risk score that accumulates |
| False positives | Low | Higher, needs tuning |
| Setup | Deploy a ruleset | Learning period to build baselines |
| Explainability | High, the rule names the reason | Lower, "anomalous" needs context |
Mature detection programs run both. Signatures handle the known and the cheap; behavioral analytics covers the gap signatures leave. Neither one alone is a detection strategy.
The telemetry behavioral analytics runs on
A behavioral model is only as good as the data feeding it. The standard sources, and what each one reveals:
- Authentication and identity logs. Active Directory, Kerberos, SSO, and VPN events. These carry the signals for account takeover: impossible-travel logins, first-time access to a system, authentication spikes, privilege use that does not fit the account.
- Endpoint (EDR). Process execution, file access, command-line activity, parent-child process trees. This is where you see a user account suddenly running reconnaissance tooling or a service account spawning a shell.
- Network. Flow data, internal east-west traffic, DNS, and proxy logs. Network behavior exposes lateral movement and beaconing: a host that starts connecting to peers it never spoke to, or steady outbound to an unfamiliar destination.
- Cloud and SaaS. Provider API calls, IAM changes, storage access, admin actions. As workloads move to the cloud, this telemetry catches the cloud-native version of every on-prem pattern: a key used from a new region, a sudden mass-download from object storage, a permission grant that does not fit the role.
- Data access. File-share, database, and DLP logs. These feed the exfiltration use case directly: who read what, how much, and how that compares to their normal volume.
The pattern across all five: behavioral analytics correlates signals that look benign in isolation. An off-hours login is fine. A large download is fine. A new internal connection is fine. The same three on the same account inside an hour are a story.
Use cases that earn their keep
Four use cases are where behavioral analytics consistently pays off, and they are the canonical ones in UEBA's design.
Insider threat. The hardest case for signature detection, because the insider is authorized. A behavioral model catches the deviation from the person's own norm: the departing employee who suddenly archives whole project directories, the admin who accesses HR records outside any ticket. Insiders are a real share of incidents. Verizon's 2026 Data Breach Investigations Report attributes 12% of breaches to internal actors, down from 18% the prior year, and notes the most common motive for insider misuse was convenience, followed by financial gain.
Compromised account / account takeover. The credential is valid, so authentication succeeds. What does not match is the behavior behind the login: a new geography, a new device, access patterns that do not fit the user. Behavioral analytics is often the only layer that distinguishes the real user from an attacker wearing their login.
Lateral movement. After an initial foothold, attackers move host to host using stolen credentials and legitimate protocols. Each step can look like normal administration. The behavioral signal is the pattern: an account or host touching systems outside its baseline, in a sequence no legitimate workflow produces. This maps directly to MITRE ATT&CK's Lateral Movement tactic (TA0008).
Data exfiltration. The final-stage signal is volume and destination against a baseline: a user or service moving far more data than usual, to a place they normally do not. This maps to ATT&CK's Exfiltration tactic (TA0010), and data-access telemetry is what makes it visible before the data is gone.
Machine learning and the false-positive problem
Behavioral analytics leans on machine learning because the scale and dimensionality defeat static rules. You cannot hand-write a threshold for every user, every entity, and every behavior, and keep it current. So the models do the profiling: unsupervised methods cluster normal activity and surface outliers without being told what to look for, supervised models learn from labelled incidents, and peer-group analysis benchmarks an entity against its cohort rather than a global average. This is the line between true UEBA and rule-only correlation, and it is also why behavioral analytics is one of the practical homes for AI anomaly detection in a SOC.
The catch is the one every analyst already knows: unusual is not the same as malicious. People take new jobs, projects spike, someone logs in from a conference. Each is a deviation, and a naive model alerts on all of them. An untuned UEBA deployment buries a SOC in noise and trains analysts to ignore it, which is worse than no detection at all.
Tuning is the work that makes it usable. Risk scoring that accumulates across a chain rather than firing on a single anomaly is the first lever: require several correlated deviations before an alert surfaces. Peer grouping cuts the false positives that a global threshold creates. Feedback loops matter too: when an analyst dispositions an alert as benign, that judgment should refine the model. A behavioral analytics platform that does not get quieter as the SOC trains it is not earning its place in the queue.
Where it fits in a SIEM and a SOC
Behavioral analytics began as a standalone product category. It largely is not one anymore. Gartner retired the standalone UEBA market guidance and now evaluates UEBA as a core capability inside the SIEM, and most analysts encounter it as a feature of their SIEM or XDR rather than a separate console. The reason is data gravity: behavioral analytics needs broad telemetry, and the SIEM is already where that telemetry lands.
In practice the SIEM does the collection and correlation, the behavioral engine builds baselines and produces risk scores on top of that data, and the scores feed back into the SIEM as prioritized alerts and into threat monitoring workflows. A high entity-risk score becomes a triage case: the analyst pivots from the score to the underlying events, the authentication trail, the process tree, the data accessed, and decides. Behavioral analytics does not replace the analyst. It changes what reaches them, from a flood of low-context rule hits to a ranked queue of entities behaving abnormally.
That changes the investigative artifact too. A behavioral alert arrives with a story already assembled: this entity, this baseline, these deviations, this accumulated score. For a responder, that is a head start, the same correlation work that used to happen by hand after the alert, done before it.
The bottom line
Behavioral analytics closes the gap between identity and behavior. Signatures catch what has been seen before; behavioral analytics catches an entity acting unlike itself, which is how insiders, stolen credentials, and signatureless attacks actually present. It runs as a loop: collect telemetry across auth, endpoint, network, cloud, and data access; model what normal looks like for each user and entity and its peer group; score the deviations and accumulate the chain into a risk number that crosses a threshold.
The payoff is a ranked queue of entities worth investigating instead of a flood of low-context rule hits, with the correlation work done before the alert reaches an analyst. The cost is tuning, because unusual is not malicious, and an untrained model is just noise. Run it alongside signatures, feed it broad clean telemetry, retrain it as the environment drifts, and it earns its place: not by replacing the analyst, but by deciding what reaches them.
Frequently asked questions
<p>Behavioral analytics is a detection approach that builds a baseline of normal activity for each user and entity, then scores activity that deviates from that baseline as potential risk. Instead of matching known-bad patterns, it asks whether an entity is acting unlike itself. Its packaged form is User and Entity Behavior Analytics (UEBA), and it is built to catch insider misuse, account takeover, and attacks that have no signature.</p>
<p>They are effectively the same thing in modern usage. Behavioral analytics is the general approach of detecting deviations from learned-normal behavior. UEBA (User and Entity Behavior Analytics) is the security product category Gartner named around 2015 that implements it, extending earlier User Behavior Analytics (UBA) to cover non-human entities like service accounts, hosts, and applications, not just users.</p>
<p>Signature-based detection matches activity against a library of known-bad patterns such as file hashes, IOCs, and rules, so it only catches what has already been catalogued. Behavioral analytics carries a model of normal and flags deviations from it, so it catches insider misuse, account takeover, and novel attacks that have no signature. Signatures produce fewer false positives but miss the unknown; behavioral analytics catches the unknown at the cost of more false positives. Mature programs run both.</p>
<p>The standard sources are authentication and identity logs (Active Directory, SSO, VPN), endpoint EDR telemetry (process and file activity), network data (flows, DNS, east-west traffic), cloud and SaaS logs (API calls, IAM changes, storage access), and data-access logs (file shares, databases, DLP). The strength of the approach comes from correlating these signals, since events that look benign alone can form a clear pattern together.</p>
<p>Yes. The scale and dimensionality of profiling every user and entity defeats static rules, so behavioral analytics uses machine learning: unsupervised models that cluster normal activity and surface outliers, supervised models trained on labelled incidents, and peer-group analysis that benchmarks an entity against its cohort. This use of statistical and ML models, rather than rules alone, is what distinguishes true UEBA from basic rule-based correlation.</p>
<p>The four canonical use cases are insider threat detection, compromised account or account-takeover detection, lateral movement detection, and data exfiltration detection. All four share a trait that makes signatures ineffective: the activity often uses valid credentials and legitimate tools, so the only reliable signal is how far the behavior deviates from the entity's normal baseline.</p>