Learn how you can become a Python programmer in just 12 weeks.

    We respect your privacy. Unsubscribe at anytime.

    Monte Carlo Simulation to Optimize a Portfolio using Pandas and NumPy

    What will we cover?

    In this tutorial we will learn about Monte Carlo Simulation. 

    First an introduction to the concept and then how to use Sharpe Ratio to find the optimal portfolio with Monte Carlo Simulation.

    The learning objective will be.

    • The principles behind Monte Carlo Simulation
    • Applying Monte Carlo Simulation using Sharpe Ratio to get the optimal portfolio
    • Create a visual Efficient Frontier based on Sharpe Ratio
    Watch lesson

    Step 1: What is Monte Carlo Simulation

    Monte Carlo Simulation is a great tool to master. It can be used to simulate risk and uncertainty that can affect the outcome of different decision options.

    Simply said, if there are too many variables affecting the outcome, then it can simulate them and find the optimal based on the values.

    Monte Carlo simulations are used to model the probability of different outcomes in a process that cannot easily be predicted due to the intervention of random variables. It is a technique used to understand the impact of risk and uncertainty in prediction and forecasting models.

    https://www.investopedia.com/terms/m/montecarlosimulation.asp

    Step 2: A simple example to demonstrate Monte Carlo Simulation

    Here we will first use it for simple example, which we can precisely calculate. This is only to get an idea of what Monte Carlo Simulations can do for us.

    The game we play.

    • You roll two dice. 
    • When you roll 7, then you gain 5 dollars.
    • If you roll anything else than 7, you lose 1 dollar.

    How can we simulate this game?

    Well, the roll of two dice can be simulated with NumPy as follows.

    import numpy as np
     
    def roll_dice():
        return np.sum(np.random.randint(1, 7, 2))
    

    Where are roll is simulated with a call to the roll_dice(). It simply uses the np.random.randint(1, 7, 2), which returns an array of length 2 with 2 integers in the range 1 to 7 (where 7 is not included, but 1 is). The np.sum(…) sums the two integers into the sum of the two simulated dice.

    Now to the Monte Carlo Simulation.

    This is simply to make a trial run and then see if it is a good game or not.

    def monte_carlo_simulation(runs=1000):
        results = np.zeros(2)
        for _ in range(runs):
            if roll_dice() == 7:
                results[0] += 1
            else:
                results[1] += 1
        return results
    

    This is done by keeping track of the how many games I win and lose.

    A run could look like this.

    monte_carlo_simulation()
    

    It could return array([176., 824.]), which would result in my win of 176*5 = 880 USD and lose of 824 USD. A total gain of 56 USD. 

    Each run will most likely give different conclusions.

    Step 3: Visualize the result of Monte Carlo Simulation Example

    A way to get a more precise picture is to make more runs. Here, we will try to record a series of runs and visualize them.

    results = np.zeros(1000)
     
    for i in range(1000):
        results[i] = monte_carlo_simulation()[0]
     
    import matplotlib.pyplot as plt
    %matplotlib notebook
     
    fig, ax = plt.subplots()
    ax.hist(results, bins=15)
    

    Resulting in this figure.

    This gives an idea of how a game of 1000 rolls returns and how volatile it is. See, if the game was less volatile, it would center around one place. 

    For these particular runs we have that results.mean()*5 gives the average return of 833.34 USD(notice, you will not get the exact same number due to the randomness involved).

    The average loss will be 1000 – results.mean() = 833.332 USD.

    This looks like a pretty even game.

    Step 4: Making the precise calculation of the example

    Can we calculate this exactly?

    Yes. The reason is, that this is a simple situation are simulating. When we have more variable (as we will have in a moment with portfolio simulation) it will not be the case.

    A nice way to visualize it is as follows.

    d1 = np.arange(1, 7)
    d2 = np.arange(1, 7)
    mat = np.add.outer(d1, d2)
    

    Where the matrix mat looks as follows.

    array([[ 2,  3,  4,  5,  6,  7],
           [ 3,  4,  5,  6,  7,  8],
           [ 4,  5,  6,  7,  8,  9],
           [ 5,  6,  7,  8,  9, 10],
           [ 6,  7,  8,  9, 10, 11],
           [ 7,  8,  9, 10, 11, 12]])
    

    The exact probability for rolling 7 is.

    mat[mat == 7].size/mat.size
    

    Where we count how many occurrences of 7 divided by the number of possibilities. This gives 0.16666666666666667 or 1/5.

    Hence, it seems to be a fair game with no advantage. This is the same we concluded with the Monte Carlo Simulation.

    Step 5: Using Monte Carlo Simulation for Portfolio Optimization

    Now we have some understanding of Monte Carlo Simulation, we are ready to use it for portfolio optimization.

    To do that, we need to read some time series of historic stock prices. See this tutorial to learn more on that.

    import pandas_datareader as pdr
    import datetime as dt
    import pandas as pd
     
    tickers = ['AAPL', 'MSFT', 'TWTR', 'IBM']
    start = dt.datetime(2020, 1, 1)
     
    data = pdr.get_data_yahoo(tickers, start)
    data = data['Adj Close']
    

    To use it with Sharpe Ratio, we will calculate the log returns.

    log_returns = np.log(data/data.shift())
    

    The Monte Carlo Simulations can be done as follows.

    # Monte Carlo Simulation
    n = 5000
     
    weights = np.zeros((n, 4))
    exp_rtns = np.zeros(n)
    exp_vols = np.zeros(n)
    sharpe_ratios = np.zeros(n)
     
    for i in range(n):
        weight = np.random.random(4)
        weight /= weight.sum()
        weights[i] = weight
         
        exp_rtns[i] = np.sum(log_returns.mean()*weight)*252
        exp_vols[i] = np.sqrt(np.dot(weight.T, np.dot(log_returns.cov()*252, weight)))
        sharpe_ratios[i] = exp_rtns[i] / exp_vols[i]
    

    The code will run 5000 experiments. We will keep all the data from each run. The weights of the portfolios (weights), the expected return (exp_rtns), the expected volatility (exp_vols) and the Sharpe Ratio (sharpe_ratios).

    Then we iterate over the range.

    First we create a random portfolio in weight (notice it will have the sum 1). Then we calculate the expected annual return. The expected volatility is calculated a bit different than we learned in the lesson about Sharpe Ratio. This is only to make it perform faster.

    Finally, the Sharpe Ratio is calculated.

    In this specific run (you might get different values) we get that the maximum Sharpe Ratio is, given by sharpe_ratios.max(), 1.1398396630767385.

    To get the optimal weight (portfolio), call weights[sharpe_ratios.argmax()]. In this specific run, array([4.57478167e-01, 6.75247425e-02, 4.74612301e-01, 3.84789577e-04]). This concludes to hold 45.7% to AAPL, 6.7% to MSFT, 47.5% to TWTR, and 0,03% to IBM is optimal.

    Step 6: Visualizing the Monte Carlo Simulation of the Efficient Frontier

    This can be visualized as follows in an Efficient Frontier.

    import matplotlib.pyplot as plt
    %matplotlib notebook
     
    fig, ax = plt.subplots()
    ax.scatter(exp_vols, exp_rtns, c=sharpe_ratios)
    ax.scatter(exp_vols[sharpe_ratios.argmax()], exp_rtns[sharpe_ratios.argmax()], c='r')
    ax.set_xlabel('Expected Volatility')
    ax.set_ylabel('Expected Return')
    

    Resulting in this chart.

    Want to learn more?

    This is part of a 2.5-hour full video course in 8 parts about Risk and Return.

    In the next lesson you will learn how to Calculate Correlation between Stock Price Movements with Python.

    12% Investment Solution

    Would you like to get 12% in return of your investments?

    D. A. Carter promises and shows how his simple investment strategy will deliver that in the book The 12% Solution. The book shows how to test this statement by using backtesting.

    Did Carter find a strategy that will consistently beat the market?

    Actually, it is not that hard to use Python to validate his calculations. But we can do better than that. If you want to work smarter than traditional investors then continue to read here.

    Python for Finance: Unlock Financial Freedom and Build Your Dream Life

    Discover the key to financial freedom and secure your dream life with Python for Finance!

    Say goodbye to financial anxiety and embrace a future filled with confidence and success. If you’re tired of struggling to pay bills and longing for a life of leisure, it’s time to take action.

    Imagine breaking free from that dead-end job and opening doors to endless opportunities. With Python for Finance, you can acquire the invaluable skill of financial analysis that will revolutionize your life.

    Make informed investment decisions, unlock the secrets of business financial performance, and maximize your money like never before. Gain the knowledge sought after by companies worldwide and become an indispensable asset in today’s competitive market.

    Don’t let your dreams slip away. Master Python for Finance and pave your way to a profitable and fulfilling career. Start building the future you deserve today!

    Python for Finance a 21 hours course that teaches investing with Python.

    Learn pandas, NumPy, Matplotlib for Financial Analysis & learn how to Automate Value Investing.

    “Excellent course for anyone trying to learn coding and investing.” – Lorenzo B.

    Leave a Comment