Uploading Vulnerabilities
The ability to upload files to a server has become an integral part of how we interact with web applications. Be it a profile picture for a social media website, a report being uploaded to cloud storage, or saving a project on Github; the applications for file upload features are limitless.
Unfortunately, when handled badly, file uploads can also open up severe vulnerabilities in the server. This can lead to anything from relatively minor, nuisance problems; all the way up to full Remote Code Execution (RCE) if an attacker manages to upload and execute a shell. With unrestricted upload access to a server (and the ability to retrieve data at will), an attacker could deface or otherwise alter existing content — up to and including injecting malicious webpages, which lead to further vulnerabilities such as XSS or CSRF. By uploading arbitrary files, an attacker could potentially also use the server to host and/or serve illegal content, or to leak sensitive information. Realistically speaking, an attacker with the ability to upload a file of their choice to your server — with no restrictions — is very dangerous indeed.
- Overwriting existing files on a server
- Uploading and Executing Shells on a server
- Bypassing Client-Side filtering
- Bypassing various kinds of Server-Side filtering
- Fooling content type validation checks
So, we have a file upload point on a site. How would we go about exploiting it?
As with any kind of hacking, enumeration is key. The more we understand about our environment, the more we’re able to do with it. Looking at the source code for the page is good to see if any kind of client-side filtering is being applied. Scanning with a directory bruteforcer such as Gobuster is usually helpful in web attacks, and may reveal where files are being uploaded to; Gobuster is no longer installed by default on Kali, but can be installed with sudo apt install gobuster
. Intercepting upload requests with Burpsuite will also come in handy. Browser extensions such as Wappalyser can provide valuable information at a glance about the site you're targetting.
With a basic understanding of how the website might be handling our input, we can then try to poke around and see what we can and can’t upload. If the website is employing client-side filtering then we can easily look at the code for the filter and look to bypass it . If the website has server-side filtering in place then we may need to take a guess at what the filter is looking for, upload a file, then try something slightly different based on the error message if the upload fails. Uploading files designed to provoke errors can help with this. Tools like Burpsuite or OWASP Zap can be very helpful at this stage.
When files are uploaded to the server, a range of checks should be carried out to ensure that the file will not overwrite anything which already exists on the server. Common practice is to assign the file with a new name — often either random, or with the date and time of upload added to the start or end of the original filename. Alternatively, checks may be applied to see if the filename already exists on the server; if a file with the same name already exists then the server will return an error message asking the user to pick a different file name. File permissions also come into play when protecting existing files from being overwritten. Web pages, for example, should not be writeable to the web user, thus preventing them from being overwritten with a malicious version uploaded by an attacker.
If, however, no such precautions are taken, then we might potentially be able to overwrite existing files on the server. Realistically speaking, the chances are that file permissions on the server will prevent this from being a serious vulnerability. That said, it could still be quite the nuisance, and is worth keeping an eye out for in a pentest or bug hunting environment.
Remote Code Execution
Remote Code Execution (as the name suggests) would allow us to execute code arbitrarily on the web server. Whilst this is likely to be as a low-privileged web user account (such as www-data
on Linux servers), it's still an extremely serious vulnerability. Remote code execution through a web application tends to be a result of uploading a program written in the same language as the back-end of the website (or another language which the server understands and will execute). Traditionally this would be PHP, however, in more recent times, other back-end languages have become more common (Python Django and Javascript in the form of Node.js being prime examples).
There are two basic ways to achieve RCE on a webserver: webshells, and reverse shells. Realistically a fully featured reverse shell is the ideal goal for an attacker; however, a webshell may be the only option available (for example, if a file length limit has been imposed on uploads). We’ll take a look at each of these in turn. As a general methodology, we would be looking to upload a shell of one kind or another, then activating it, either by navigating directly to the file if the server allows it, or by otherwise forcing the webapp to run the script for us.
Note that when using webshells, it’s usually easier to view the output by looking at the source code of the page. This drastically improves the formatting of the output.
Filtering
Up until now we have largely been ignoring the counter-defences employed by web developers to defend against file upload vulnerabilities. From here on out, we’ll be looking at some of the defence mechanisms used to prevent malicious file uploads, and how to circumvent them.
First up, let’s discuss the differences between client-side filtering and server-side filtering.
When we talk about a script being “Client-Side”, in the context of web applications, we mean that it’s running in the user’s browser as opposed to on the web server itself. JavaScript is pretty much ubiquitous as the client-side scripting language, although alternatives do exist. Regardless of the language being used, a client-side script will be run in your web browser. In the context of file-uploads, this means that the filtering occurs before the file is even uploaded to the server. Theoretically, this would seem like a good thing, right? In an ideal world, it would be; however, because the filtering is happening on our computer, it is trivially easy to bypass. As such client-side filtering by itself is a highly insecure method of verifying that an uploaded file is not malicious.
Conversely, as you may have guessed, a server-side script will be run on the server. Traditionally PHP was the predominant server-side language (with Microsoft’s ASP for IIS coming in close second); however, in recent years, other options (C#, Node.js, Python, Ruby on Rails, and a variety of others) have become more widely used. Server-side filtering tends to be more difficult to bypass, as you don’t have the code in front of you. As the code is executed on the server, in most cases it will also be impossible to bypass the filter completely; instead we have to form a payload which conforms to the filters in place, but still allows us to execute our code.
With that in mind, let’s take a look at some different kinds of filtering.
Extension Validation:
File extensions are used (in theory) to identify the contents of a file. In practice they are very easy to change, so actually don’t mean much; however, MS Windows still uses them to identify file types, although Unix based systems tend to rely on other methods, which we’ll cover in a bit. Filters that check for extensions work in one of two ways. They either blacklist extensions (i.e. have a list of extensions which are not allowed) or they whitelist extensions (i.e. have a list of extensions which are allowed, and reject everything else).
File Type Filtering:
Similar to Extension validation, but more intensive, file type filtering looks, once again, to verify that the contents of a file are acceptable to upload. We’ll be looking at two types of file type validation:
- MIME validation: MIME (Multipurpose Internet Mail Extension) types are used as an identifier for files — originally when transfered as attachments over email, but now also when files are being transferred over HTTP(S). The MIME type for a file upload is attached in the header of the request, and looks something like this:
- MIME types follow the format <type>/<subtype>. In the request above, you can see that the image “spaniel.jpg” was uploaded to the server. As a legitimate JPEG image, the MIME type for this upload was “image/jpeg”. The MIME type for a file can be checked client-side and/or server-side; however, as MIME is based on the extension of the file, this is extremely easy to bypass.
- Magic Number validation: Magic numbers are the more accurate way of determining the contents of a file; although, they are by no means impossible to fake. The “magic number” of a file is a string of bytes at the very beginning of the file content which identify the content. For example, a PNG file would have these bytes at the very top of the file:
89 50 4E 47 0D 0A 1A 0A
.
- Unlike Windows, Unix systems use magic numbers for identifying files; however, when dealing with file uploads, it is possible to check the magic number of the uploaded file to ensure that it is safe to accept. This is by no means a guaranteed solution, but it’s more effective than checking the extension of a file.
File Length Filtering:
File length filters are used to prevent huge files from being uploaded to the server via an upload form (as this can potentially starve the server of resources). In most cases this will not cause us any issues when we upload shells; however, it’s worth bearing in mind that if an upload form only expects a very small file to be uploaded, there may be a length filter in place to ensure that the file length requirement is adhered to.
File Name Filtering:
As touched upon previously, files uploaded to a server should be unique. Usually this would mean adding a random aspect to the file name, however, an alternative strategy would be to check if a file with the same name already exists on the server, and give the user an error if so. Additionally, file names should be sanitised on upload to ensure that they don’t contain any “bad characters”, which could potentially cause problems on the file system when uploaded (e.g. null bytes or forward slashes on Linux, as well as control characters such as ;
and potentially unicode characters). What this means for us is that, on a well administered system, our uploaded files are unlikely to have the same name we gave them before uploading, so be aware that you may have to go hunting for your shell in the event that you manage to bypass the content filtering.
File Content Filtering:
More complicated filtering systems may scan the full contents of an uploaded file to ensure that it’s not spoofing its extension, MIME type and Magic Number. This is a significantly more complex process than the majority of basic filtration systems employ.
It’s worth noting that none of these filters are perfect by themselves — they will usually be used in conjunction with each other, providing a multi-layered filter, thus increasing the security of the upload significantly. Any of these filters can all be applied client-side, server-side, or both.
Similarly, different frameworks and languages come with their own inherent methods of filtering and validating uploaded files. As a result, it is possible for language specific exploits to appear; for example, until PHP major version five, it was possible to bypass an extension filter by appending a null byte, followed by a valid extension, to the malicious .php
file. More recently it was also possible to inject PHP code into the exif data of an otherwise valid image file, then force the server to execute it. These are things that you are welcome to research further, should you be interested.
Bypassing Client-Side Filtering
We’ll begin with the first (and weakest) line of defence: Client-Side Filtering.
As mentioned previously, client-side filtering tends to be extremely easy to bypass, as it occurs entirely on a machine that you control. When you have access to the code, it’s very easy to alter it.
There are four easy ways to bypass your average client-side file upload filter:
- Turn off Javascript in your browser — this will work provided the site doesn’t require Javascript in order to provide basic functionality. If turning off Javascript completely will prevent the site from working at all then one of the other methods would be more desirable; otherwise, this can be an effective way of completely bypassing the client-side filter.
- Intercept and modify the incoming page. Using Burpsuite, we can intercept the incoming web page and strip out the Javascript filter before it has a chance to run. The process for this will be covered below.
- Intercept and modify the file upload. Where the previous method works before the webpage is loaded, this method allows the web page to load as normal, but intercepts the file upload after it’s already passed (and been accepted by the filter).
- Send the file directly to the upload point. Why use the webpage with the filter, when you can send the file directly using a tool like
curl
? Posting the data directly to the page which contains the code for handling the file upload is another effective method for completely bypassing a client side filter. The syntax for such a command would look something like this:curl -X POST -F "submit:<value>" -F "<file-parameter>:@<path-to-file>" <site>
. To use this method you would first aim to intercept a successful upload (using Burpsuite or the browser console) to see the parameters being used in the upload, which can then be slotted into the above command.
We will be covering methods two and three in depth below.
Let’s assume that, once again, we have found an upload page on a website:
As always, we’ll take a look at the source code. Here we see a basic Javascript function checking for the MIME type of uploaded files:
In this instance we can see that the filter is using a whitelist to exclude any MIME type that isn’t image/jpeg
.
Our next step is to attempt a file upload — as expected, if we choose a JPEG, the function accepts it. Anything else and the upload is rejected.
Having established this, let’s start Burpsuite and reload the page. We will see our own request to the site, but what we really want to see is the server’s response, so right click on the intercepted data, scroll down to “Do Intercept”, then select “Response to this request”:
When we click the “Forward” button at the top of the window, we will then see the server’s response to our request. Here we can delete, comment out, or otherwise break the Javascript function before it has a chance to load:
Having deleted the function, we once again click “Forward” until the site has finished loading, and are now free to upload any kind of file to the website:
It’s worth noting here that Burpsuite will not, by default, intercept any external Javascript files that the web page is loading. If you need to edit a script which is not inside the main page being loaded, you’ll need to go to the “Options” tab at the top of the Burpsuite window, then under the “Intercept Client Requests” section, edit the condition of the first line to remove ^js$|
:
We’ve already bypassed this filter by intercepting and removing it prior to the page being loaded, but let’s try doing it by uploading a file with a legitimate extension and MIME type, then intercepting and correcting the upload with Burpsuite.
Having reloaded the webpage to put the filter back in place, let’s take the reverse shell that we used before and rename it to be called “shell.jpg”. As the MIME type (based on the file extension) automatically checks out, the Client-Side filter lets our payload through without complaining:
Once again we’ll activate our Burpsuite intercept, then click “Upload” and catch the request:
Observe that the MIME type of our PHP shell is currently image/jpeg
. We'll change this to text/x-php
, and the file extension from .jpg
to .php
, then forward the request to the server:
Now, when we navigate to http://demo.uploadvulns.thm/uploads/shell.php
having set up a netcat listener, we receive a connection from the shell!
Bypassing Server-Side Filtering: File Extensions
Time to turn things up another notch!
Client-side filters are easy to bypass — you can see the code for them, even if it’s been obfuscated and needs processed before you can read it; but what happens when you can’t see or manipulate the code? Well, that’s a server-side filter. In short, we have to perform a lot of testing to build up an idea of what is or is not allowed through the filter, then gradually put together a payload which conforms to the restrictions.
Firstly we’ll take a look at a website that’s using a blacklist for file extensions as a server side filter. There are a variety of different ways that this could be coded, and the bypass we use is dependent on that. In the real world we wouldn’t be able to see the code for this, but for this example, it will be included here:
<?php
//Get the extension
$extension = pathinfo($_FILES["fileToUpload"]["name"])["extension"];
//Check the extension against the blacklist -- .php and .phtml
switch($extension){
case "php":
case "phtml":
case NULL:
$uploadFail = True;
break;
default:
$uploadFail = False;
}
?>
In this instance, the code is looking for the last period (.
) in the file name and uses that to confirm the extension, so that is what we'll be trying to bypass here. Other ways the code could be working include: searching for the first period in the file name, or splitting the file name at each period and checking to see if any blacklisted extensions show up. We'll cover this latter case later on, but in the meantime, let's focus on the code we've got here.
We can see that the code is filtering out the .php
and .phtml
extensions, so if we want to upload a PHP script we're going to have to find another extension. The wikipedia page for PHP gives us a bunch of options we can try -- many of them bypass the filter (which only blocks the two aforementioned extensions), but it appears that the server is configured not to recognise them as PHP files, as in the below example:
This is actually the default for Apache2 servers, at the time of writing; however, the sysadmin may have changed the default configuration (or the server may be out of date), so it’s well worth trying.
Eventually we find that the .phar
extension bypasses the filter -- and works -- thus giving us our shell:
Let’s have a look at another example, with a different filter. This time we’ll do it completely black-box: i.e. without the source code.
Once again, we have our upload form:
Ok, we’ll start by scoping this out with a completely legitimate upload. Let’s try uploading the spaniel.jpg
image from before:
Well, that tells us that JPEGS are accepted at least. Let’s go for one that we can be pretty sure will be rejected (shell.php
):
Can’t say that was unexpected.
From here we enumerate further, trying the techniques from above and just generally trying to get an idea of what the filter will accept or reject.
In this case we find that there are no shell extensions that both execute, and are not filtered, so it’s back to the drawing board.
In the previous example we saw that the code was using the pathinfo()
PHP function to get the last few characters after the .
, but what happens if it filters the input slightly differently?
Let’s try uploading a file called shell.jpg.php
. We already know that JPEG files are accepted, so what if the filter is just checking to see if the .jpg
file extension is somewhere within the input?
Pseudocode for this kind of filter may look something like this:
ACCEPT FILE FROM THE USER -- SAVE FILENAME IN VARIABLE userInput
IF STRING ".jpg" IS IN VARIABLE userInput:
SAVE THE FILE
ELSE:
RETURN ERROR MESSAGE
When we try to upload our file we get a success message. Navigating to the /uploads
directory confirms that the payload was successfully uploaded:
Activating it, we receive our shell:
This is by no means an exhaustive list of upload vulnerabilities related to file extensions. As with everything in hacking, we are looking to exploit flaws in code that others have written; this code may very well be uniquely written for the task at hand. This is the really important point to take away: there are a million different ways to implement the same feature when it comes to programming — your exploitation must be tailored to the filter at hand. The key to bypassing any kind of server side filter is to enumerate and see what is allowed, as well as what is blocked; then try to craft a payload which can pass the criteria the filter is looking for.
Bypassing Server-Side Filtering: Magic Numbers
We’ve already had a look at server-side extension filtering, but let’s also take the opportunity to see how magic number checking could be implemented as a server-side filter.
As mentioned previously, magic numbers are used as a more accurate identifier of files. The magic number of a file is a string of hex digits, and is always the very first thing in a file. Knowing this, it’s possible to use magic numbers to validate file uploads, simply by reading those first few bytes and comparing them against either a whitelist or a blacklist. Bear in mind that this technique can be very effective against a PHP based webserver; however, it can sometimes fail against other types of webserver (hint hint).
Let’s take a look at an example. As per usual, we have an upload page:
As expected, if we upload our standard shell.php file, we get an error; however, if we upload a JPEG, the website is fine with it. All running as per expected so far.
From the previous attempt at an upload, we know that JPEG files are accepted, so let’s try adding the JPEG magic number to the top of our shell.php
file. A quick look at the list of file signatures on Wikipedia shows us that there are several possible magic numbers of JPEG files. It shouldn't matter which we use here, so let's just pick one (FF D8 FF DB
). We could add the ASCII representation of these digits (ÿØÿÛ) directly to the top of the file but it's often easier to work directly with the hexadecimal representation, so let's cover that method.
Before we get started, let’s use the Linux file
command to check the file type of our shell:
As expected, the command tells us that the filetype is PHP. Keep this in mind as we proceed with the explanation.
We can see that the magic number we’ve chosen is four bytes long, so let’s open up the reverse shell script and add four random characters on the first line. These characters do not matter, so for this example we’ll just use four “A”s:
Save the file and exit. Next we’re going to reopen the file in hexeditor
(which comes by default on Kali), or any other tool which allows you to see and edit the shell as hex. In hexeditor the file looks like this:
Note the four bytes in the red box: they are all 41
, which is the hex code for a capital "A" -- exactly what we added at the top of the file previously.
Change this to the magic number we found earlier for JPEG files: FF D8 FF DB
Now if we save and exit the file (Ctrl + x), we can use file
once again, and see that we have successfully spoofed the filetype of our shell:
Perfect. Now let’s try uploading the modified shell and see if it bypasses the filter!
There we have it — we bypassed the server-side magic number filter and received a reverse shell.
Example Methodology
We’ve seen various different types of filter now — both client side and server side — as well as the general methodology for file upload attacks. let’s take the opportunity to discuss an example methodology for approaching this kind of challenge in a little more depth. You may develop your own alternative to this method, however, if you’re new to this kind of attack, you may find the following information useful.
We’ll look at this as a step-by-step process. Let’s say that we’ve been given a website to perform a security audit on.
- The first thing we would do is take a look at the website as a whole. Using browser extensions such as the aforementioned Wappalyzer (or by hand) we would look for indicators of what languages and frameworks the web application might have been built with. Be aware that Wappalyzer is not always 100% accurate. A good start to enumerating this manually would be by making a request to the website and intercepting the response with Burpsuite. Headers such as
server
orx-powered-by
can be used to gain information about the server. We would also be looking for vectors of attack, like, for example, an upload page. - Having found an upload page, we would then aim to inspect it further. Looking at the source code for client-side scripts to determine if there are any client-side filters to bypass would be a good thing to start with, as this is completely in our control.
- We would then attempt a completely innocent file upload. From here we would look to see how our file is accessed. In other words, can we access it directly in an uploads folder? Is it embedded in a page somewhere? What’s the naming scheme of the website? This is where tools such as Gobuster might come in if the location is not immediately obvious. This step is extremely important as it not only improves our knowledge of the virtual landscape we’re attacking, it also gives us a baseline “accepted” file which we can base further testing on.
- An important Gobuster switch here is the
-x
switch, which can be used to look for files with specific extensions. For example, if you added-x php,txt,html
to your Gobuster command, the tool would append.php
,.txt
, and.html
to each word in the selected wordlist, one at a time. This can be very useful if you've managed to upload a payload and the server is changing the name of uploaded files.
- Having ascertained how and where our uploaded files can be accessed, we would then attempt a malicious file upload, bypassing any client-side filters we found in step two. We would expect our upload to be stopped by a server side filter, but the error message that it gives us can be extremely useful in determining our next steps.
Assuming that our malicious file upload has been stopped by the server, here are some ways to ascertain what kind of server-side filter may be in place:
- If you can successfully upload a file with a totally invalid file extension (e.g.
testingimage.invalidfileextension
) then the chances are that the server is using an extension blacklist to filter out executable files. If this upload fails then any extension filter will be operating on a whitelist. - Try re-uploading your originally accepted innocent file, but this time change the magic number of the file to be something that you would expect to be filtered. If the upload fails then you know that the server is using a magic number based filter.
- As with the previous point, try to upload your innocent file, but intercept the request with Burpsuite and change the MIME type of the upload to something that you would expect to be filtered. If the upload fails then you know that the server is filtering based on MIME types.
- Enumerating file length filters is a case of uploading a small file, then uploading progressively bigger files until you hit the filter. At that point you’ll know what the acceptable limit is. If you’re very lucky then the error message of original upload may outright tell you what the size limit is. Be aware that a small file length limit may prevent you from uploading the reverse shell we’ve been using so far.