Threat Hunting and Detection with Web Proxy Logs

Published in

Blu Raven

6 min readAug 12, 2020

As I mentioned in my previous post about detecting and responding to ransomware attacks, I created a hunting and detection guide using web proxy logs.

Web Proxies generate a common set of information that can be used for threat hunting and detection. This information contains Duration, HTTP Status, Bytes In, Bytes Out, Protocol, HTTP Method, HTTP Version, URL Category, URL Hostname, URL Path, URL Query, Mime Type, File Name, User-Agent.

Below, I explained how we can use this information to hunt or detect threats.

Duration

This information shows how long the transaction has taken. Malware can communicate with the C2 server over the HTTP(S) protocol. When this is the case, it asks for commands periodically. This period doesn't have to be a constant value like every 10 minutes. Malware can also use jitter to make random-looking requests. Also, keeping the connection open can also be used by malware. In any case, it needs to either ask for commands very often or keep the connection open.

Technique

Calculate the sum per SourceIP-DestinationIP pair over 12/24 hours

What to look for

Higher values may indicate beaconing. Keep in mind that not all beacons are malicious. That's why we are hunting.
Note: If you apply the same method to your public websites, you can detect web scraping or customer data scraping.

HTTP Status

Users visit websites, post something, sometimes upload some data, or download a file. In normal conditions, these transactions have an HTTP 200 result. When it comes to malware, it is possible to use HTTP error codes as a C2 channel. Also, most malware use DGA(domain generation algorithm) in order to keep the connection persistent if one of the domains is blocked. In such a case, the malware keeps getting HTTP errors and tries the next domain.

Technique

Calculate the total count of the HTTP Status Codes per SourceIP or per SourceIP-DestinationIP over a specific time period.
List URLs having only HTTP Errors.

What to look for

Higher values of an uncommon HTTP Status Code may indicate C2 activity.
Higher values of HTTP errors for a website can indicate failed C2 activity.

Bytes In

In normal conditions, when a user visits a website, downloads a file, etc., each transaction has a different size. On the other hand, malware visits the same page every time. This makes the downloaded content has the same size unless the attacker starts interacting with the victim machine.

Technique

Calculate the count of BytesIn per Source-Destination pair over 12/24 hours. You have the best chance when the attackers sleep as there is no interaction.
Calculate the ratio of count(BytesIn) per Source-Destination pair. This is for comparing the attacker interaction versus idle status.

What to look for

Higher values may indicate beaconing. C2 servers reply with the same data, making Bytes In value the same.
Higher values of ratio may indicate C2 beaconing.

Bytes Out

A normal user activity consists mostly of downloading data. Uploaded data is usually small unless there is a file/data upload to a website.

Technique

Calculate the sum of BytesOut per Source-Destination pair over 12/24 hours.
Calculate the ratio of count(BytesOut) per Source-Destination pair over 12/24 hours.

What to look for

Higher values may indicate data exfiltration.
Higher values of ratio may indicate beaconing.

HTTP Method

In normal circumstances, a user's web traffic contains a large amount of HTTP GET, a small amount of HTTP POST methods. Other HTTP methods, such as HTTP PUT, are expected to be seen less.

Technique

Calculate the ratio of the POST or PUT over GET per Source-Destination over 4/8/12/24 hours.

What to look for

Higher values of ratio may indicate beaconing or exfiltration.

URL Hostname

Usually, a user visits websites that are in the top 1M list. In some cases, an unpopular website can be visited by lots of users as well (think about 3rd parties having business with the company).

Technique

Compare with top 1M domains and calculate the hit count.
Calculate hit count per Hostname.

What to look for

Hit count <5 and Hostname is not in the top 1M may indicate malicious payload delivery.
Small number of hit count may indicate malicious payload delivery.

URL Path

When an attacker compromises a website and uses it as a C2 server, the malware most probably uses the same URL Path for C2 communication.

Technique

Calculate count per Source-Destination-URLPath pair.

What to look for

Higher values may indicate beaconing.

URL Query

URL query information is seen when you search for an item on a website. Malware does the same when asking the C2 server if there is anything to run on the victim machine. The query can be encoded/encrypted as well.

Technique

Calculate count per Source-Destination-URLQuery.
Calculate the length of URLQuery.
Look for base64 encoded strings in URLQuery.

What to look for

Higher values may indicate beaconing.
Higher values may indicate encoded data, a sign of exfiltration or beaconing.
Encoded strings may indicate beaconing or exfiltration.

I hope you find this guide useful for your hunts! I've also created a cheat sheet and shared it in my GitHub repo. You can find the pdf version here.

Thanks for reading this article! If you have any questions, leave a comment below. Want to master KQL for Threat Hunting, Detection Engineering, and DFIR in a hyper-realistic environment? Visit my academy for a free course!

Mehmet is the founder of Blu Raven Academy. He brings over 15 years of experience in cybersecurity, with a unique blend of expertise in KQL, threat hunting, detection engineering, and data science to his courses to help others advance their skills. Recognized four times as a Microsoft Security MVP, he is renowned for adapting the RITA beacon analyzer to KQL, developing novel methods for detecting threats, and for his insightful presentations at key conferences like the SANS DFIR Summit.

Threat Hunting and Detection with Web Proxy Logs

Duration

Technique

What to look for

HTTP Status

Technique

What to look for

Bytes In

Technique

What to look for

Bytes Out

Technique

What to look for

HTTP Method

Technique

What to look for

URL Hostname

Technique

What to look for

URL Path

Technique

What to look for

URL Query

Technique

What to look for

Mime(Content) Type

Technique

What to look for

User Agent

Technique

What to look for

URL Category

Technique

What to look for

HTTP Version

Technique

What to look for

Protocol

Technique

What to look for

File Name

Technique

What to look for

Written by Mehmet Ergene