Developing Bitcoin algorithmic trading strategies

Published in

The Startup

11 min readMar 28, 2018

Developing algorithmic trading models and strategies is no simple task. To make matters worse the current state of crypto is highly volatile and rapidly changing. The market has become war zone due to regulations from the SEC and various governments targeting crypto exchanges. Despite all of the negative news, many traders are making it big in day-trading crypto assets.

In our previous post I wrote about our progress in making short-term predictions. In this post I take it to a next level. I will illustrate our efforts of how we are incorporating the predictions into a successful trading strategy.

Manual modelling

Developing an algorithmic model/strategy usually starts by looking at the raw data but more importantly analysing several indicators such as SMA, MACD, EMA and RSI. The whole point of the process is trying to find some patterns that are pretty obvious to the human eye, but also that these patterns are reoccurring throughout history and hopefully will continue to do so in the future.

In theory

Most strategies are based around making a profit, so people look for ways to detect a valley so they can trigger a “buy” signal, and then let the system wait until there’s a peak to trigger a “sell” signal. This is risky business, since there are no guarantees, i.e.: a system cannot accurately tell (given historical events) whether that market price is at a valley or not — it can only make a calculated guess. And that’s the whole point, we try to make a calculated guess, that is the plausibility of being at a valley/peak and triggering a trading signal (either buy or sell).

In reality

The reality is much more tricky than the theory. A system may indicate it’s at a valley, but a few minutes later the price drops even lower.

If you have ever written code for large software projects then you know that error/failure rate grows in proportion to every new line of code added. This means that the more code you write, the more mistakes/bugs/errors you induce — humans are not perfect, neither are programmers no matter how brilliant. When we write code we usually have a clear goal in mind, thus we know what the output should be given some clearly defined input. But try doing that with input date of (crypto-) assets (e.g. Bitcoin), you’ll quickly realize that it’s not a trivial thing to do. The input is never the same, and we cannot simply rely on a bunch of “if this — do that” code. However, that’s the only language a computer understands, it’s our most reliable tool for the job.

To make our job easier we have to introduce math and statistics to aid us. This is done by creating indicators, as mentioned earlier, such as SMA and MACD. An indicator is a function that takes raw data, transforms it and spits something new out. Indicators make our lives easier in analyzing the raw market data, they allow us to detect/see patterns that we wouldn’t otherwise see. If you know a tad about trading then you’ll know how useful the MACD, long & short SMA plots are.

Since indicators are solely based on historical data, they are not very good at making predictions, but neither completely useless. To illustrate this have a look at the following chart:

In the above we have the price of Bitcoin plotted out as candlesticks. Then I created an indicator (EMA) of some size, based of the Open and Close values. The next step would be to use code, that is “if this — do that” statements, to come up with a basic strategy that would have a positive ROI (incorporating trading fees). Remember my story of peaks & valleys earlier? Well here we see it in action. My algorithm uses the EMA indicator to generate a first buy signal (indicated by ‘B’ on the chart), in this case it’s designed to anticipate a valley, because after rain usually comes sunshine. However, after this buy no sun came, but only more rain (the price kept going down). And then later a sell signal is generated at some local peak.

On this chart we can clearly see how a simple algorithm makes decisions. The sad part is that its decisions are not optimal, the reason is apparent since at each step, the algorithm can only look at the past — it cannot foretell the future to make “the best” decision. The other sad part is that such a simple algorithm has hard-coded parameters, such as the indicator’s parameters (e.g. EMA size). To us humans, we may look at these generated signals and say “what a stupid decision” — we cannot argue at all, a machine can only make the decisions we program it to do. If the parameters are sub-optimal, so is the outcome.

At the lower part of the image I indicate the ROI of the generated signals, which is slightly over 94%, meaning the system made a loss of about 6% — reason: it bought higher than it sold, and to make matters worse it paid a 0.1% fee on every trade.

Below is another scenario. This time we use three indicators, but only one is shown. The brown bars represent the MACD indicator, which is built up from two different EMA indicators:

The basic idea behind using MACD as a trading strategy, is to yet again, detect peaks & valleys. This becomes apparent when you look at where the buy/sell signals appear, i.e. at some local MACD’s peak/valley. This strategy appears to work better than the previous one, since the ROI is over 99% (i.e. it still made a loss of about 1%) — but we may not compare the two just like that because the previous example only had two trading signals and this one has way more.

Here’s a third and final example. Below is a chart with signals based on the EMA signal (just like in our first case). But this time I have chosen a different region and more data.

The ROI in the above is over 99%, just like in the case of our MACD. A closer look at the signals is pretty fascinating, it made two short buy/sell trades, which are only a few minutes apart. In this case I tried to let the system make a profit in a very short amount of time by leveraging the high volatility.

Verifying a strategy

These three examples were based on a very little data. Everything shown on the chart is what was used to generate the trading signals and compute the ROI. In practice, we have to run our algorithm on days, weeks, months or even years worth of data to verify its success rate. Unfortunately very few of these strategies proved to be successful in our tests. Actually, all of the above can generate ROI of over 100% — if there were no trading fees, since that is how exchanges prevent us from becoming millionaires over night.

Deep data

Using raw market data (e.g. price & volume), combined with a few indicators will rarely yield a profitable trading strategy. Unless your algorithm is pretty sophisticated and well-designed — if so re-check everything because you may have a bug or unaccounted for scenario after all.

After a long time and countless attempts I did manage to come up with a few profitable trading algorithms. These were achieved by utilizing some default indicators which I had to adjust in several ways prior to applying. But more importantly, it’s the type of data I use which makes these methods successful. They are no longer based on just the price and/or volume, but take other factors into consideration such as sentiments (from our sentiment analyses).

On the chart above you can see one of these strategies in action. Pay close attention to where it generates the Buy/Sell signals — they appear to be way more optimal than in the previous three examples, but far from perfect. An even more important remark is the ROI, it’s positive (116%) and comes down to an average increase of 1% per day.

Below is another chart, which uses the exact same strategy but at a different date/time. Yet again the results are phenomenal with an ROI of 125%, and a daily increase of about 3%.

This strategy I continue to test, validate and optimize to this day — it appears to be the best I’ve been able to come up with thus far. Notice how far apart the buy and sell signals are compared to our previous examples. Here they are many hours or even several days apart, while previously it was just minutes or a few hours. Before I sell my kidney and go all-in, I need to make sure it will really work. Hopefully within a few weeks I’ll plug it in and write a new post showing my results.

Artificial Intelligence

One may develop as many models as he/she wishes to, or at least, as many he/she has the guts to. It’s not a straightforward development process. It’s pretty hard to test and verify new hypotheses while simultaneously tweaking its many parameters and trying different values. But what if we can use A.I. to help us come up with even better models and strategies?

This research area is even more complex than creating manual models, but it’s already the future — and if you’re not with it, you’re behind. At some point, if not already, A.I. will make better trading decisions (both long- & short-term) than humans do. If you are an A.I. researcher/developer, you know exactly how complex it is to make a system come up with a trading strategy. It’s already complex enough to train it for carrying out simple tasks such as recognizing objects in pictures.

Hybrid modelling

Until some weeks ago I was using a manually designed strategy which used our predictions to generate trading signals. The results were pretty “okay” at that time, but they did not incorporate trading fees — so they actually are “not so okay” anymore.

One of these strategies was to look at the highest and lowest prediction points and generate a sell/buy signal at these respective intervals. When you think about it, it’s definitely not going to be an optimal strategy by far. One may just as well start buying/selling randomly and have more luck through that. But I wouldn’t throw this idea in the garbage just yet. In my previous post I’ve illustrated our recent improvements of the predictions. So it’s definitely worth a shot re-running this old strategy on some new data.

What comes to mind is that these strategies are short-term, meaning they use whatever its predicted to make a decision in the moment. Since our predictions are usually no more than 3 to 15 minutes into the future, they will need to generate large enough margins to pay off the trading fees and thus generate a positive ROI.

There is one specific thing that I did add into the mix: predicting the next buy/sell signal. I wanted to teach our system to predict worthwhile buy/sell signals, the same way we make it predict the future price. To do this I first had to train the system on what a good buy and good sell position looks like. I did this by going over all the data, and using a look-ahead concept: given a position ‘t’, if in the near-future the price is going to drop a lot then ‘t’ should indicate a ‘sell’ signal — but if the price is going to go up then ‘t’ should instead be a ‘buy’. The near-future was defined by the next 10 or-so intervals. Every other ‘t’ would indicate a ‘do nothing’ signal. Below is a chart that shows the result of my efforts:

On the chart above, we see the price predictions on the blue/black candlestick graph. The purple line is the result of my signal-prediction ‘thing’ as just explained. I have taught the system that a ‘sell’ signal equals the value of 100 ; a buy equals 50 and ‘do nothing’ equals to zero. In the end the system does not fully respect my values, so it generates something either close to zero, around 50 or close to 100. But it does have some interesting results. If you look closely, it does indicate some interesting ‘buy’ signals (= value 50), and then you have to make a guess for which near-future interval the ‘sell’ signal corresponds with some ‘buy’ signal. In the end this may prove a useful trading strategy, but it’s really hard to interpret and understand as is. I need to do a lot more research and development to improve it.

Q-learning

This is a form of reinforcement learning (RL), a technique for training A.I. to do certain stuff. RL is the trick behind teaching a machine how to play chess, Go and even Space Invaders.

If we can teach a machine how to play Pac Man, then we might as well teach it to generate buy/sell signals. This is exactly what I did. The logic behind it is to give the system a reward (like a pat on the back) when it makes a good decisions, and if not we punish it. The reward/punishment is expressed as a number, and so we train the system to optimize itself for obtaining the highest possible score. At start it makes a bunch of random guesses, gets its reward/punishment and then learns/re-trains itself based on these results. This process continues indefinitely, until we tell it to stop. We don’t want the system to be over-fitted with respect to the input data, but we also don’t want it to be super random — so we have to make it stop after a certain amount of iterations.

The implementation of Q-learning was done by using an open-source project. I only had to tweak & adjust it according to my needs. I also realized that the backbone of Q-learning systems was usually based on a feed forward neural network, I haven’t stumbled upon any solution that used a recurrent neural network instead (yet). But this is what I did implement. I quickly learned that training a system through RL is a very tedious and long process, it takes literally hours/days to complete, depending on the number of iterations and some other factors.

After many painful hours of training I did see some nice buy/sell signals appear, however to me they were more like random than intelligent. Overall Q-learning did not prove to be a successful technique, but I am going to improve it because I believe it’s a gold nugget.

Thank you for reading and stay tuned for the next episode! :)
- Ilya Nevolin