Top
« « The Seasonality Problem (Part Six) | The Seasonality Problem (Part Eight) » »

Comments

6 Responses to “The Seasonality Problem (Part Seven)”

  1. chupitero on December 3rd, 2007 8:27 am

    Isn’t the chitest non-parametric test used to determine the relationship between two nominal variables? I don’t see either data being nominal or any variable violating the normality assumption so wouldn’t it be more appropriate to use a parametric test which are viewed to pack a lot more statistical muscle? Also, wouldn’t a significance level lower than what was expected only lead to the conclusion that there is a relationship or for hypothesis testing that there was a difference between what was expected and what was observed?

  2. MTM on December 3rd, 2007 9:49 am

    Thanks for your feedback, chupitero. Actually a parametric test (e.g. t-test, etc.) may be performed, but the problems lie in the experiment design:

    1. there are no two populations that can be compared against each other
    2. we don’t have a proper assumption on the expected mean of the distribution. arguably for price data, the variance of the distribution could be infinite, so we can’t predict an “average price”

    As a workaround, chitest was used but by transforming the months into categories, and converting the days in the month as frequencies–in effect converting the experiment into nominal data. This avoids the need to presume a mean for the distribution (i.e the actual price values), while dwelling on the actual occurrences which satisfy the criteria.

    The chitest in excel computes the alpha level that the test is significant at: and at such a low level it does suggest that there is a difference between the expected and actual distribution–which disproves the null hypothesis that Seasonality has no effect on returns.

    I do agree that parametric tests are more powerful, and I appreciate any suggestions on how we can adjust the experiment to accomodate such a test.

  3. chupitero on December 4th, 2007 7:58 pm

    Isn’t the second population the win-loss ratio? If we have the data for each 180 day trial then we can compute the mean can’t we? Personally though I would prefer using %gain for each 180 day period instead of a win-loss ratio. As a guide I think we can use the methodology found in various studies on the january effect in the US stockmarket.

  4. MTM on December 4th, 2007 8:16 pm

    In the experiment, there’s only one population of 1,767 180-day trials which result in either a Win or a Loss. We then categorize them by month to see if the month (i.e. seasonality) has an effect on the distribution of wins and losses.

    Are you suggesting we take the winning trades as one population, and the losing trades as another population, and then check for differences in their means? We already know this, since the winning trades will have a positive return, and the losing trades will have a negative one. This won’t help us in isolating the effect of the month.

    As an aside, another way I thought earlier was to treat each monthly batch of trades as a separate population, then check for differences in the means. But your standard parametric tests won’t accomodate more than two populations–since we’ll have 12 sets of trades to check with each other. So maybe another method (like ANOVA) might do the trick?

  5. chupitero on December 5th, 2007 7:43 pm

    My mistake, there’s only one population. I still think we should use % gain instead of a win-loss ratio though but sample size needs to be increased from just 7 years. I don’t see why we can’t use the more common t statistic instead. The standard p value of .05 for significance tests still leaves a 5% probability that the result was due to chance though but I think the most important thing is how we explain the results. How do we explain the market anomaly? Is the market just egregiously inefficient or is there some other hypothesis?

  6. MTM on December 6th, 2007 12:18 am

    I agree that the % gain should also be tested for significance. My current dilemma is how to redesign the experiment to be able to use other tests to isolate the effect of the month-of-the-year variable. The Chi-test only tests for frequencies, which is in itself, incomplete.

    Meanwhile, as to the more illuminating question: “Why?” This study opens a pandora’s box of other possible things:

    1. A cyclical component. Which is in itself an anomaly. And agree that this should be extended further in the past. In the next run I plan to take it at least 10, maybe 20 years to see if the dynamic persists. If it doesn’t then it begs another question: why does it work from 2000?

    2. Seasonality is tricky since we are likely capturing some other behaviour that just happens to fall in the proper time. This is either an effect on prices in October or an effect on prices in 180 days hence March/April, or both.

    3. For companies: culprits I’d like to check is corporate fiscal years, taxation periods, quarterly earnings. The activity around these items are usually September and April–so may contribute to something.

    4. Just out of the blue: Fund rebalancing activity:
    a. The Ghost Months phenomenon which is associated with low fund activity occurs usually from May to September–which is incidentally the anti-thesis of the ideal period indicated by the study which is October to April. The data shows poor performance of purchases made in January to March–the 180 day sell periods of these months would fall within the Ghost Months.

    b. For the same reason, the “window dressing” myth which occurs around December to January could also account for the strong finish of our ideal period.

    5. Macroeconomic cycles, weather patterns are also good “oddball” candidates to check.

    The problem as always, with any of the above, is the availability of data to test against.

Feel free to leave a comment...
and oh, if you want a pic to show with your comment, go get a gravatar!





Bottom