Remote Code Execution for Java Developers

Published in

cisco-amp-technology

9 min readApr 23, 2019

If you want to go straight to code or prefer to read this in markdown, head to our repo.

This post is meant to be a gateway of sorts for Java developers into security. Many topics and attacks in computer security rely on intimate knowledge of some unfamiliar technology (e.g. computer architecture, asm). Therefore, it is sometimes a little overwhelming. We will see that security bugs and exploits leading to remote code execution are possible with relatively simple Java code.

First, we will recap some relevant Java features (polymorphism, serialization and reflection). Then we’ll dive into a demo that demonstrates a particular Java security hole which uses these features. Finally, we’ll talk about lessons which can be applied to make your code more secure.

Java Feature Review

Polymorphism

Polymorphism or “one interface, many implementations” is a major feature of object-oriented programming languages. Java supports this behavior with interfaces, abstract classes and concrete classes.

The java.util.Map interface is a good example to study. This interface defines method signatures that a class must implement in order to be considered a Map. The Java standard library includes a few implementations of this interface like java.util.HashMap or its thread-safe equivalent java.util.concurrent.ConcurrentHashMap. Again, one interface, many implementations.

We could even make our own Map implementation.

If we find that IntegerToStringMap has functionality that we want to reuse then we can extend it to make more Map implementations.

What if we want to prevent this? Java allows you to specify that a concrete class should not be extended with the final keyword.

This would stop AnotherMap and YetAnotherMap from being accepted by the Java compiler or the JVM.

How about using polymorphic classes? Continuing with the Map example, polymorphism in Java allows us to write code like this:

This is particularly useful since we can write the useMap method without caring which Map implementation is passed into the method.

Serialization

Serialization is the act of converting structured data (objects in Java) to an array of bytes. A program should then be able to recover the structured data through a reversal of the process (deserialization). There are standard techniques to help since serialization is very common. Java includes its own mechanism for serialization through the java.io.Serializable interface and the java.io.ObjectOutputStream and java.io.ObjectInputStream classes.

If you create a class that implements Serializable like this.

It can be serialized and deserialized like this.

In the above code, serialization will be attempted for the entire object graph stemming from example1. Everything in the object graph needs to be of a type the implements Serializable or a primitive type (e.g. long or byte[]). The Example class has a single Integer data field. Integer implements Serializable, and Integer contains a single primitive int field. Therefore this condition is met for the example1 object graph, and serialization should succeed.

Reflection

Reflection is probably the most difficult aspect of this tutorial. It is a somewhat advanced feature set that usually isn’t needed when making Java applications. At Java One 2016, I remember Mark Reinhold and Alex Buckley asking a room of Java developers if they use the Java reflection API — most kept their hands down.

Reflection is not needed in the demo server code that a Java application developer would write. However, we will use reflection to create the exploit.

Reflection is a type of metaprogramming that lets you obtain information on and even modify parts of your program at runtime. A simple use of reflection is annotation processing where you are obtaining information about your program. Assume we have the following annotation definition and usage.

You would then use code like this to extract the annotation value from TestClass at runtime.

In general, you can do some pretty funky things with the reflection API. Another example, which we’ll apply later, is implementing an interface at runtime. Here is how you can implement a java.util.Collection with reflection.

The dummy lambda in the above snippet implements the java.lang.reflect.InvocationHandler interface. It is code that will be called on every method invocation and it must decide how to handle each different method call.

Remote Code Execution Demo

Server Setup

This demo is built around a web server that accepts and deserializes a submission. The submission class isn’t really anything special. Here it is, com.cisco.amp.server.Submission, stripped of comments.

This class implements Serializable to enable sending and receiving Submission instances over a network. Therefore the server can take bytes from HTTP requests and deserialize them to Submission instances. The submit method in com.cisco.amp.server.SubmissionController does this with the following code.

There would probably be a bunch of other logic in a real application. This serves as a fairly realistic example though. We deserialize the incoming bytes, do something with the object (call toString in this case) and return something back in the HTTP response.

However this small amount of server-side code is enough to introduce a vulnerability. The sin we committed is not validating user input. We take bytes that the user sends us and trust that they are what we expect them to be — a well-behaved instance of Submission.

Exploit Development

We will develop a client that crafts and sends an exploit to the server. Our goal will be to make the server launch the calculator application. This is a pretty classic example. The idea is if you can get calculator to launch, then you could launch whatever other program you wanted to. For example, launching some sort of remote shell application would be useful to an attacker.

By crafting a special submission (our exploit), we should be able to abuse the inherent trust that the server gives to submissions. This is where polymorphism and reflection come into play. We will be able to trick the the server into executing code we dictate by using these two concepts.

Polymorphic Exploit (Attempt)
First, notice that the Submission class has a Collection<String> member. Since Collection is an interface, it is effectively saying that it doesn't care what type of Collection it is - any implementation will do. This makes some sense since the result will be the same with ArrayList<String>, HashSet<String> and other normal implementations of Collection. What if it isn't a normal collection implementation? What if an attacker supplied a custom collection with the behavior they want? That is exactly what we will try to do.

The rest of this post will use client code that looks like this (full source in our repo).

That is, create a Submission instance, serialize it and send it to the server. The interesting stuff, which we'll talk about now, is the contents of the makeExploitCollection method.

First, we should note that the server will need to call whatever custom code we put into our exploit Collection. In this case, a Collection method is called by the server that we can override. Note that the server calls Submission::toString which looks like this.

The for-each syntax above, for (String entry : values), calls Collection::iterator behind the scenes. Therefore if we implement Collection::iterator with custom code, the server should run it. This is simple enough if we extend ArrayList<String> and override that one method. Note that ArrayList implements Serializable so our extension will as well. The following code should launch calculator when the server runs it.

However, we hit a snag if we try to send this exploit to the server. The server prints an exception with a stacktrace.

The name com.cisco.amp.client.Client$1 was given to the anonymous class we created in our client. This is the server saying that it can't find the bytecode for com.cisco.amp.client.Client$1.

Let’s take a look at what we sent to the server. This is a String rendering of the exploit attempt.

We can see the references to classes we use, along with a little bit of data. The bytecode for these classes wasn’t sent along though. Java deserialization uses classloaders to try finding the bytecode for these classes. In this case, a java.net.URLClassLoader instance is searching through jar files on the classpath to find com.cisco.amp.client.Client$1. It throws the above exception when it fails to find the class.

This means our exploit won’t work in its current form. The server needs to be able to access and execute our exploit code for it to work. This can be accomplished by using classes that the server already has. In the next section we’ll see how reflection can make the exploit work in this manner.

Polymorphic and Reflective Exploit
Going forward, we will assume that the following dependency is present in the server.

This is a little contrived, but not unreasonable considering that modern web apps are usually library soup. Additionally, bringing the dependency in will make this section much simpler.

The idea here is to use reflection to implement our exploit Collection in a manner similar to the reflection recap section. Therefore, the exploit creation method will look something like this.

Here the Collection is reflectively implemented by java.lang.reflect.Proxy. This should work because Proxy implements Serializable, and it will be on the server's classpath (all of the Java standard library is — usually). We still need an InvocationHandler implementation though.

Remember that we can’t just make our own implementation of one. We can only use code on the server for this exploit. This is where the groovy-all dependency comes in. It contains two very useful classes: org.codehaus.groovy.runtime.ConvertedClosure and org.codehaus.groovy.runtime.MethodClosure. ConvertedClosure implements InvocationHandler, and it facilitates the reflective implementation of a class method with a Closure (like a Java lambda) you construct it with. MethodClosure provides a Closure implementation for running a system command (like launching calculator). They both implement Serializable.

Now, our reflective Collection implementation, with a custom Collection::iterator method, can be constructed like this.

Note that we are not creating new code for the server to execute. We are combining classes it already has.

All the demo code is in our repo. If you run the demo, the server will launch calculator. When you run it, there is another exception printed to the server logs even though the exploit works. An attacker would need a better exploit to avoid the exception printing (if they cared about stealth).

Server Code Improvements

We have seen the path to successfully exploit the demo server. After going through an exercise like this, we can better understand what would have stopped or made the attack more difficult. We’ll go through some possible changes here and briefly describe how they could benefit the demo server.

Validate User Input

The big sin committed in the server code was not validating user input. Generally, this isn’t something that you want to do yourself. Using a library or framework will give much better results as there will be edge cases that you don’t think of. However, in this scenario a couple things that might help are:

Only accept one specific collection implementation.
Ensure the Collection implementation and Submission are declared final in their class definitions, so they can not be extended.
Don’t use generics in the definition of concrete classes that will be serialized. We didn’t see why in this exercise, but you can probably figure it out after reading about Java type erasure.
This list is not exhaustive by any means.

These suggestions focus on preventing an attacker from supplying a class of their own design. Input validation is an extremely important measure to take in general though. Proper input validation can safe guard against other common attacks (e.g. SQL injection).

Avoid Java Serialization

This ties into validating user input. Java Serialization is a really powerful serialization technique with many features. It is often overkill, and a more restrictive serialization method (e.g. JSON) would usually work just as well.

Using and validating against a more restrictive serialization standard gives an attacker less wiggle room. In the demo, a JSON document containing an array would allow us to accept a collection of Strings in a much safer manner. Additionally, it looks like this will be required sooner or later, as Java maintainers want to remove Java serialization.

Better Manage Dependencies

In the demo, we used classes from groovy-all to craft our exploit. This was an unnecessary dependency for our server, which means it should be removed. Removing unnecessary dependencies gives an attacker less to work with. You can even remove parts of the Java standard library, starting in Java 9, by creating a custom Java runtime.

If a dependency is needed, then it should be kept up to date. Generally, the latest bug fix release will do, as long as the major version being used is still supported. This also applies to the groovy-all dependency. Newer versions contain safeguards so ConvertedClosure and MethodClosure can't be abused like in the demo. You can read about the groovy changes here.

Use Minimal Permissions

If you run the demo and look at a process tree listing, then it will look something like this.

Calculator is launched by the server, and it is running as the same user the server runs as. In this case, it is my personal account, so an attacker could do as much damage as I personally can. If the server were running as root, the attacker could do more. If the server had its own dedicated account, then the attacker would be able to do much less.

Originally published at github.com.