|
April 1998 Volume 8 Number 4 Forecasting Transportation Expense: A Shipper's Perspective Regression analysis provides an understanding into the relationship between actions and outcomes, and how it all affects transportation costs.
By Jeff Zoroya
A major retailer is scheduling delivery to hundreds of stores. Should the deliveries be made in the morning, the afternoon or the evening? What is the cost associated with each course of action? A major consumer products manufacturer is locating a new distribution center to support its growing business in the Southeast. One alternative location is Jacksonville, another is Orlando. Although the costs appear equal, are they? What are the transportation costs associated with the two locations?
A common obstacle shippers face to controlling transportation expense is not fully understanding its underlying cost drivers. By setting service requirements and locating facilities, shippers have enormous influence over their transportation expense. However, they are often unable to quantify that influence. What follows is a way shippers can measure the cost drivers of their transportation expense using their own data, an Excel spreadsheet and a basic understanding of regression analysis.
Why use regression?
What should we examine?
Necessary Data and Origin-to-Destination Cost
30 observations will not increase the accuracy of our analysis, but simply turn it into a fancy weighted average. Fortunately for us, our data set meets both of these criteria, we have 43 distinct lanes that represent truckload movement throughout Europe.
In our first pass, we want to determine what influence the distance from the origin to the destination has on our truckload rates. Following the steps outlined in the help section of Excel, and as shown in Figure 1, we: 1) set the Y range equal to the price of our lanes, 2) set the X range equal to our lane distances, 3) set our confidence level at 95 percent, 4) indicate that our model includes column labels (Cost and Km), 5) force the intercept (constant) to zero, and 6) ask that our residuals be displayed in the output. Remember, we are forcing the intercept to zero, which means the regression will estimate the cost of a lane based on its distance only in effect, a cost per kilometer.
Before continuing with this example, it's important to consider what to look for in the output. First, the "Adjusted R Squared" measures the cost variability, by lane, that our regression can predict. The more variability we can predict, the better. Second, the "t statistic" measures how important each of our inputs (known as independent variables) are in predicting our lane costs. As long as an independent variable's "t statistic" falls outside a certain range of values, we know it contributes to the accuracy of our forecast. This range changes slightly with the size of the problem, but generally with 30 or more degrees of freedom, a "t statistic" of greater than 1.7 or less then -1.7 means that particular independent variable contributes to the accuracy of our model (degrees of freedom are the number of observations less the number of independent variables in the regression, and are indicated in the output as "Residual"). If an independent variable does not contribute to our regression's accuracy, we may want to exclude it.
Looking at our output (Figure 2), we notice several things. First, our model accounts for 59 percent of the variability in our lane rates. Second, our independent variable lane distance contributes to the accuracy of our model with a "t statistic" of 18.8. Finally, our model estimates our cost per kilometer to be $1.40.
Setup Cost and Effect of Origin-to-Destination Cost
After running this second regression (Figure 3), we see that the accuracy of our model has improved. We can now account for 64 percent of the variability in our lane rates. Both of our independent variables are significant in influencing our model. The "t statistic" of the intercept (setup cost) is 1.8, and the "t statistic" of the cost per kilometer is 8.7. The cost of getting the carrier's equipment to our origin is $287 per move. Our cost per kilometer, from origin to destination, is $1.19. It's not unusual to see the values of the independent variables change as you increase the level of detail in your model. As you can see, the cost per kilometer has already decreased from $1.40 to $1.19 because part of the original cost includes the setup cost of $287.
In our next pass, we want to measure the cost of traversing water along our lanes. We do this by creating a dummy variable, in the form of a column (labeled water) in which we will enter a value of either "0" or "1" for each lane. If a lane contains any movement via water, we would indicate this by placing a "1" in the appropriate cell; if not, we would place a "0" in the cell. This water column will now be included in the "X" range of our regression analysis. Additionally, we will make our first exclusion by removing the Ferrania-to-Birmingham lane. This observation appears to be an outlier with some property that we are unable to identify.
After running this regression (Figure 4) we notice that the "t statistic" of our dummy variable "water" indicates that it is not significant. However, our regression can now account for 67 percent of the variability in our lane rates. Although we may end up discarding "water" later, we will retain it for now because it did improve our ability to forecast our lane rates. Now that we have isolated the additional cost of traversing water to $218, our cost per kilometer has decreased again to $1.14. Additionally, we begin to see a pattern in the lanes that are the more difficult to predict (Figure 5). Three of these lanes all have destinations in the former East Bloc. We appear to be under-forecasting the cost of shipping to these destinations.
Next we will measure the impact of shipping to eastern Europe by again creating a dummy variable column. If the destination of a lane lies within eastern Europe, we will indicate this by placing a "1" in the appropriate cell. If not, we will place a "0" in the cell.
As you can see in the regression output (Figure 6), the accuracy of our model has increased. We can now account for 92 percent of the variability of the price of a lane. Additionally, the "t statistics" of our independent variables (setup cost, water, eastern Europe, and cost per kilometer) indicate that they all contribute to the accuracy of our model. Finally, we can now see that there is a cost penalty of $1,899 for shipping to eastern Europe. The reason for this is probably due to the number of days the carrier spends clearing the border and the inability of the carrier to find a back haul out of eastern Europe.
In our last regression we will look at the impact of an improved back haul opportunity on our lane cost. We can create an additional dummy variable ("Low Cost Zone") and include the areas where we appear to be over-forecasting the cost of a lane. An examination of our last output shows that we are over-forecasting cost when the destination is Paris, northern Italy, the Low Countries/Ruhr industrial region. In all three regions, carriers probably have an improved chance of finding follow-on business at the conclusion of our move. So we should reduce our lane rates to reflect those opportunities.
Once we run this regression (Figure 7), we can see that our accuracy has increased slightly to 93 percent, and that all the independent variables, including our newly created low cost zone, are still significant. Additionally, we can see that there is an incentive of $262 to ship into these three "Low Cost Zone" destinations. Now, using the coefficients from our model, we can make lane-to-lane cost comparisons by multiplying the same distance by our forecasted cost drivers.
Additional opportunities
in the appropriate place on the April Reader Service Form Copyright © 2020 by APICS The Educational Society for Resource Management. All rights reserved. All rights reserved. Lionheart Publishing, Inc. 2555 Cumberland Parkway, Suite 299, Atlanta, GA 30339 USA Phone: +44 23 8110 3411 | E-mail: Web: www.lionheartpub.com Web Design by Premier Web Designs E-mail: [email protected] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||