Run Environment 6: Run Values 2

The last table for our Run Environment set is almost exactly the same as the one I presented before with two differences: we’re returning the runMinus and RunPlus from the intermediate table, and we’re using a variable to define wOBAscale- making the whole thing much more elegant and easy to understand.

# The last Run Values table, returning fields from RunValues1A, defining wOBAscale
# as a variable, and giving us the weighted factors by league year for batting events
DROP TABLE IF EXISTS tblRunValues2;
CREATE TABLE tblRunValues2 AS
SELECT year
, league_id
, RperOut 
, runBB 
, runHB 
, run1B 
, run2B 
, run3B 
, runHR 
, runSB 
, runCS 
, runMinus 
, runPlus 
, wOBA 
, @ws := 1/(runPlus+runMinus) AS wOBAscale 
, (runBB+runMinus)*@ws AS wobaBB 
, (runHB+runMinus)*@ws AS wobaHB 
, (run1B+runMinus)*@ws AS woba1B 
, (run2B+runMinus)*@ws AS woba2B 
, (run3B+runMinus)*@ws AS woba3B 
, (runHR+runMinus)*@ws AS wobaHR 
, runSB*@ws AS wobaSB 
, runCS*@ws AS wobaCS 
FROM tblRunValues1A;

This is the table that we’ll refer to when calculating wOBA for individual players, factoring wobaXX against their counting stats XX.  It’ll make more sense when we actually do it.

This is almost all of the data we need to complete our batting stats.  What remains is park factor.  I had been hoping that we could use the factors listed on the parks table.  Unfortunately, OOTP’s resident MySQL Whisperer says it’s not true. I’m going to set that aside for today- while I figure out how to calculate park factors – and use an incremental approach to developing the batting stats view.  I figure that will give me one, maybe two, posts to get a good method for that.

Run Environment 5: An Intermediate Table

In addition to using variables, Wyers also does the other thing I had thought about using: an intermediate table to store the runMinus, runPlus, and wOBAScale values.  On the one hand, this makes sense as we will only have to write out the calculations once and then refer to them in this table when we need them.  On the other hand, if we managed to use variables for defining the run values in the previous table, we should be able to do so with these values as well.

Also, in this table, we define league wOBA.  It makes sense that the weighted On Base Average for the entire league is just the entire league’s On Base Percentage.  If we’re looking at an average for the entire league, there is nothing to weigh it against.  We’ll see this idea again when we look at wRC+.

A few minor adjustments to the code to allow for OOTP’s naming conventions and one final difference:  I’m taking the batting stats from the league_history_batting table rather than the players_ file.  It’s already there and there are far fewer rows to sort through.  And given that we’re not filtering out any stats, it’s silly not to use it.  And, because we’re using it, we can use the column for singles (s) rather than deriving singles from H-2b-3b-HR.

#Creating an intermediate table so as not to have to write out formulae for rumMinus etc.
DROP TABLE IF EXISTS tblRunValues1A;
CREATE TABLE IF NOT EXISTS tblRunValues1A AS
SELECT r.year
, r.league_id
, r.RperOut 
, r.runBB 
, r.runHB 
, r.run1B 
, r.run2B 
, r.run3B 
, r.runHR 
, r.runSB 
, r.runCS 
, SUM(runBB*(BB-IBB) + runHB * HP + run1B * s + run2B * d 
   + run3B * t + 1.4 * HR + runSB * SB - runCS * CS) 
   / SUM(AB - H + SF) AS runMinus 

, SUM(runBB * (BB-IBB) + runHB * HP + run1B * s + run2B * d 
   + run3B * t + 1.4 * HR + runSB * SB - runCS * CS) 
   / SUM(BB-IBB + HP + H) AS runPlus

, SUM(H+BB-IBB+HP) / SUM(AB+BB-IBB+HP+SF) AS wOBA 
 
FROM tblRunValues r
INNER JOIN league_history_batting_stats 
   ON r.year=league_history_batting_stats.year 
   AND r.league_id=league_history_batting_stats.league_id

 
GROUP BY 
r.year
, r.league_id
, r.RperOut 
, r.runBB 
, r.runHB 
, r.run1B 
, r.run2B 
, r.run3B 
, r.runHR 
, r.runSB 
, r.runCS 
 
ORDER BY 
r.year DESC;

Run Environment 4: Run Values A Different Way

My attempt in the last post resulted in a view of linear weights that, while it seemed to work, was very unwieldy.  It was so because for each of the linear weights, I had to spell out fully the calculations for runPlus, runMinus, and wobaScale.

I’ve been aware of this code from Colin Wyers for a while now, but wanted to try my hand at translating Tango’s views into MySQL on my own.  I’ve done that.  Now, I’m going to try to pick apart how Wyers uses variables to simplify the process and configure this back to my OOTP tables.  Probably best to have that link open in another tab while I walk through this.

I’ll be breaking this across several posts.

We’re going to ignore the first statement that creates a primary position (PrimPos) table.  As mentioned before, this table would serve to eliminate stats created by non-pitchers pitching.  My feeling is that they are part of the run environment and should be included and, besides, this is a rarer event in OOTP than it is in real life.

The second thing to note is that Wyers is creating a table rather than a view.  This is actually a good idea.  Views save disk space but have slower performance than tables.  That’s because views are essentially stored queries that are run every time they are accessed.  Tables are created once and saved as a data object in their own right.  Since space is not a concern for me and all of the advanced stats will require complex queries, I think that opting for performance here is the right thing to do.

Next, I am going to leave in place my view vLeagueRunsPerOut – the foundational view/table.  They’re too similar for me to make any changes.  Besides, I like that I thought to total up league Plate Appearances in mine.  It comes in handy later.

Minor changes to the RunValues table give me this:

DROP TABLE IF EXISTS tblRunValues;
CREATE TABLE IF NOT EXISTS tblRunValues
AS SELECT year
, league_id
, RperOut 
, @rb := RperOut+0.14 AS runBB 
, @rb+0.025 AS runHB 
, @rs := @rb+0.155 AS run1B 
, @rd := @rs+0.3 AS run2B 
, @rd+0.27 AS run3B 
, 1.4 AS runHR 
, 0.2 AS runSB 
, 2*RperOut+0.075 AS runCS 
FROM vLeagueRunsPerOut;

 

Two things are happening here that I didn’t think we could do.  First, declaring variables with just the ‘@’ and the ‘:=’ operator.  Murach, whose book I’ve been using to learn MySQL, doesn’t mention this as an option.  He does usually point out shortcuts that some devs take before advising against using them yourself.  This squares with the other web searches I’ve done regarding variables in MySQL.  But, it seems to work, so…

Second, I could have sworn that I had read that you can’t rely on how MySQL will evaluate variables when used like this.  That is, there’s not a guarantee that it will evaluate ‘@rb’ first and use that value to evaluate ‘@rs’.  It could decide to do ‘@rs’ first and force ‘@rb’ to null.  That didn’t happen here, and I am a little confused. (NOTE: I emailed Wyers about this and will update if he gets back to me.)

Bottom line, though, is that this worked.  Comparing the results from my original view without variables to Wyers’ table with them shows identical results if we disregard all of the trailing 0’s:

Oirignal View created without variables
Results from modified RunValues table with variables

OK then.  On to the next one.

Run Environment 3: The Second Run Values View

Here’s what we’re doing for the next view.  We’re taking all of the run expectancy values from the previous view, making 3 more calculations that I will explain in a moment, and creating linear weights for each non-out batting event that we’ll use to derive wOBA.

runMinus establishes a run value for outs and other events that are not part of OBP.  It’s AB – H + SF, and is equivalent to a batting out.

runPlus gives us the average run value for all non-out batting events: Hits, Hit-by-Pitches, and Walks.

wOBA will give us the Weighted On Base Average for the league

wobaScale will give us the factor to apply to all of the other linear weights and for each league year in order to scale players’ wOBAs to their matching, unweighted On Base Percentage.

Two notes before the code.  First, we’re using stats from the players_career_batting_stats table.  I wrote a join to the league_history_stats table when I was thinking of doing it another way.  I may go back and switch it out to the other table as it will save us doing the sums.

Second, there’s got to be a more elegant way to do this.  I think there is, and it’s to do with defining runPlus, runMinus, and wobaScale as variables so that I don’t have to elaborate the code each time they are mentioned.  I know that I can’t use column aliases within the select statement.  So my options are leaving it as is (I am getting results that look reasonable), try my luck with variables, or create another table or view to store these values and call them from this view.

I will come back to this to try to clean it up.

 

CREATE OR REPLACE VIEW vRunValues2 AS
SELECT 
r.year
, r.league_id
, r.RperOut 
, r.runBB 
, r.runHB 
, r.run1B 
, r.run2B 
, r.run3B 
, r.runHR 
, r.runSB 
, r.runCS 
, Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr)
    + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS)
    / Sum(b.ab -b.h + b.SF) AS runMinus 
, Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP +r.run1B * (b.H -b.d -b.t -b.hr)
    + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr + r.runSB * b.SB - r.runCS * b.CS)
    / Sum(b.BB - b.IBB + b.HP +b.H) AS runPlus 

, #1/([runPlus]+[runMinus]) AS wOBAscale 
1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP +r.run1B * (b.H -b.d -b.t -b.hr)
    + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr + r.runSB * b.SB - r.runCS * b.CS))
    / (Sum(b.BB - b.IBB + b.HP +b.H) + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP
    + r.run1B * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR
    + r.runSB * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF))) AS wOBAscale

, #([runBB]+[runMinus])*[wOBAscale] AS wobaBB 
(r.runBB + (Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B
    * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB
    * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF))) 
    * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP +r.run1B
    * (b.H -b.d -b.t -b.hr) + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr
    + r.runSB * b.SB - r.runCS * b.CS)) / (Sum(b.BB - b.IBB + b.HP +b.H)
    + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B 
    * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR 
    + r.runSB * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF)))) AS wobaBB

, #([runHB]+[runMinus])*[wOBAscale] AS wobaHB 
(r.runHB +(Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B 
   * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR 
   + r.runSB * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF))) 
   * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP +r.run1B * (b.H -b.d -b.t -b.hr) 
   + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr + r.runSB * b.SB - r.runCS * b.CS)) 
   / (Sum(b.BB - b.IBB + b.HP +b.H) + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP 
   + r.run1B * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR 
   + r.runSB * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF)))) AS wobaHB

, #([run1B]+[runMinus])*[wOBAscale] AS woba1B 
(r.run1b + (Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr) 
   + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS) 
   / Sum(b.ab -b.h + b.SF))) * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP 
   +r.run1B * (b.H -b.d -b.t -b.hr) + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr 
   + r.runSB * b.SB - r.runCS * b.CS)) / (Sum(b.BB - b.IBB + b.HP +b.H) 
   + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr) 
   + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS) 
   / Sum(b.ab -b.h + b.SF)))) AS woba1B

, #([run2B]+[runMinus])*[wOBAscale] AS woba2B 
(r.run2b + (Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr) 
   + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS) 
   / Sum(b.ab -b.h + b.SF))) * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP 
   +r.run1B * (b.H -b.d -b.t -b.hr) + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr 
   + r.runSB * b.SB - r.runCS * b.CS)) / (Sum(b.BB - b.IBB + b.HP +b.H) 
   + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP 
   + r.run1B * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR 
   + r.runSB * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF)))) AS woba2B

, #([run3B]+[runMinus])*[wOBAscale] AS woba3B 
(r.run3B + (Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr) 
   + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS) 
   / Sum(b.ab -b.h + b.SF))) * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP 
   +r.run1B * (b.H -b.d -b.t -b.hr) + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr 
   + r.runSB * b.SB - r.runCS * b.CS)) / (Sum(b.BB - b.IBB + b.HP +b.H) 
   + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr) 
   + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS) 
   / Sum(b.ab -b.h + b.SF)))) AS woba3B

, #([runHR]+[runMinus])*[wOBAscale] AS wobaHR 
(r.runHR + (Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr) 
   + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS) 
   / Sum(b.ab -b.h + b.SF))) * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP 
   +r.run1B * (b.H -b.d -b.t -b.hr) + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr 
   + r.runSB * b.SB - r.runCS * b.CS)) / (Sum(b.BB - b.IBB + b.HP +b.H) 
   + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP + r.run1B * (b.H -b.d - b.t -b.hr) 
   + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR + r.runSB * b.SB - r.runCS *b.CS) 
   / Sum(b.ab -b.h + b.SF)))) AS wobaHR

, #[runSB]*[wOBAscale] AS wobaSB 
r.runSB * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP +r.run1B * (b.H -b.d -b.t -b.hr) 
   + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr + r.runSB * b.SB - r.runCS * b.CS)) 
   / (Sum(b.BB - b.IBB + b.HP +b.H) + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP 
   + r.run1B * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR 
   + r.runSB * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF)))) AS wobaSB

, #[runCS]*[wOBAscale] AS wobaCS
r.runCS * (1/ ((Sum(r.runBB * (b.BB - b.ibb) + r.runHB * b.HP +r.run1B * (b.H -b.d -b.t -b.hr) 
   + r.run2B *b.d + r.run3B * b.t + 1.4 *b.hr + r.runSB * b.SB - r.runCS * b.CS)) 
   / (Sum(b.BB - b.IBB + b.HP +b.H) + Sum(r.runBB* (b.bb - b.ibb) + r.runHB * b.HP 
   + r.run1B * (b.H -b.d - b.t -b.hr) + r.run2B * b.d + r.run3B * b.t + 1.4*b.HR 
   + r.runSB * b.SB - r.runCS *b.CS) / Sum(b.ab -b.h + b.SF)))) AS wobaCS

FROM 
vRunValues r
INNER JOIN players_career_batting_stats b ON r.year=b.year AND r.league_id=b.league_id
INNER JOIN league_history_batting_stats lhb ON r.year=lhb.year AND r.league_id=lhb.league_id

GROUP BY 
r.year
, r.league_id
, r.RperOut 
, r.runBB 
, r.runHB 
, r.run1B 
, r.run2B 
, r.run3B 
, r.runHR 
, r.runSB 
, r.runCS


ORDER BY 
r.year DESC;

Run Environment 1: League Stats

Here’s where things start to get fun. I’m going to be setting up a number of views that will allow me to calculate some of the more advanced statistics. The ones that I am most interested in are Weighted On-Base Average (wOBA), Weighted Runs Created (wRC), Weighted Runs Above Average (wRAA), and Weighted Runs Created+ (wRC+). If this blog ever becomes a book or major motion picture, I will come back and fill in some detail about what these stats are and why they are meaningful. Until then, I will leave the links to do the explaining.

I am basing all of this Run Environment work on a couple of posts and formulas posted by Tom Tango on his blog.

wOBA is the gatekeeper to all of these other stats, and it can’t be derived from a player’s individual stats alone.  It requires some baseline statistics about the run environment in which a batter plays.  The first step is to determine, for each league year, the number of Runs, Outs, Plate Appearances, Runs per Out, and Runs per PA.  I don’t have a good way, yet, to separate out sub_leagues, and Tango doesn’t seem to care at this point, so I leave it alone.

The other thing that Tango does and that I have decided to omit is limiting the summed pitching stats to those accumulated by players whose primary position is pitcher.  My experience is that non-pitchers pitch so rarely in OOTP that it is not worth creating a new view to determine primary position.

I create a view, as such:

CREATE OR REPLACE VIEW vLeagueRunsPerOut AS
SELECT p.year
, p.league_id
, SUM(p.r)/sum(p.outs) AS "RperOut"
, sum(p.r) AS "totR"
, sum(p.outs) AS "totOuts"
, sum(p.outs)+sum(p.ha)+sum(p.bb)+ sum(p.iw)+ sum(p.sh)
   + sum(p.sf) AS "totPA"
, round(sum(p.r)/(sum(p.outs)+sum(p.ha)+sum(p.bb)+ sum(p.iw)+ sum(p.sh)
   + sum(p.sf)),8) AS "RperPA"
FROM players_career_pitching_stats AS p
GROUP BY p.year, p.league_id;

Tables 2: Leagues, Sub_Leagues, and Divisions

One interesting feature in each of these tables is the ‘gender’ column.  Leaving aside the question of why leagues, subleagues, or divisions would need to be gendered (at least in English), it does beg the question of whether, perhaps OOTP will ever consider mixed or even female-only leagues.

Creating the leagues table

The leagues table has a LOT of columns.  Part of me thinks that I should figure out a way to truncate the data before loading it into the database because there’s just sooo many!  And I really only need the first several.  On the other hand, I would have to this every time I generated a data dump.  There will never be so many leagues that this table will become too large, so I will leave it alone for now.  Note that we are only interested in the first 4 columns:

CREATE TABLE `leagues` (
  `league_id` int(11) NOT NULL,
  `name` varchar(50) DEFAULT NULL,
  `abbr` varchar(50) DEFAULT NULL,
  `nation_id` int(11) DEFAULT NULL,
  `language_id` int(11) DEFAULT NULL,
  `gender` int(11) DEFAULT NULL,
  `historical_league` tinyint(4) DEFAULT NULL,
  `logo_file_name` varchar(200) DEFAULT NULL,
  `players_path` varchar(200) DEFAULT NULL,
  `start_date` date DEFAULT NULL,
  `preferred_start_date` date DEFAULT NULL,
  `pitcher_award_name` varchar(50) DEFAULT NULL,
  `mvp_award_name` varchar(50) DEFAULT NULL,
  `rookie_award_name` varchar(50) DEFAULT NULL,
  `defense_award_name` varchar(50) DEFAULT NULL,
  `fictional_players` tinyint(4) DEFAULT NULL,
  `start_fantasy_draft` tinyint(4) DEFAULT NULL,
  `trading_deadline` tinyint(4) DEFAULT NULL,
  `winter_meetings` tinyint(4) DEFAULT NULL,
  `arbitration_offering` tinyint(4) DEFAULT NULL,
  `show_draft_pool` tinyint(4) DEFAULT NULL,
  `rosters_expanded` tinyint(4) DEFAULT NULL,
  `draft_date` date DEFAULT NULL,
  `rule_5_draft_date` date DEFAULT NULL,
  `roster_expand_date` date DEFAULT NULL,
  `trade_deadline_date` date DEFAULT NULL,
  `allstar_date` date DEFAULT NULL,
  `days_until_deadline` int(11) DEFAULT NULL,
  `next_draft_type` int(11) DEFAULT NULL,
  `parent_league_id` int(11) DEFAULT NULL,
  `league_state` smallint(6) DEFAULT NULL,
  `season_year` int(11) DEFAULT NULL,
  `historical_year` smallint(6) DEFAULT NULL,
  `league_level` smallint(6) DEFAULT NULL,
  `stats_detail` int(11) DEFAULT NULL,
  `historical_import_path` varchar(200) DEFAULT NULL,
  `foreigner_percentage` smallint(6) DEFAULT NULL,
  `was_ootp6` tinyint(4) DEFAULT NULL,
  `was_65` tinyint(4) DEFAULT NULL,
  `allstar_game` tinyint(4) DEFAULT NULL,
  `auto_schedule_allstar` tinyint(4) DEFAULT NULL,
  `allstar_team_id0` int(11) DEFAULT NULL,
  `allstar_team_id1` int(11) DEFAULT NULL,
  `schedule_file_1` varchar(200) DEFAULT NULL,
  `schedule_file_2` varchar(200) DEFAULT NULL,
  `rules_rule_5` tinyint(4) DEFAULT NULL,
  `rules_minor_league_options` tinyint(4) DEFAULT NULL,
  `rules_trading` tinyint(4) DEFAULT NULL,
  `rules_draft_pick_trading` tinyint(4) DEFAULT NULL,
  `rules_financials` tinyint(4) DEFAULT NULL,
  `rules_amateur_draft` tinyint(4) DEFAULT NULL,
  `rules_fa_compensation` tinyint(4) DEFAULT NULL,
  `rules_schedule_balanced` tinyint(4) DEFAULT NULL,
  `rules_schedule_inter_league` tinyint(4) DEFAULT NULL,
  `rules_schedule_force_start_day` tinyint(4) DEFAULT NULL,
  `rules_trades_other_leagues` tinyint(4) DEFAULT NULL,
  `rules_free_agents_from_other_leagues` tinyint(4) DEFAULT NULL,
  `rules_free_agents_leave_other_leagues` tinyint(4) DEFAULT NULL,
  `rules_allstar_game` tinyint(4) DEFAULT NULL,
  `rules_spring_training` tinyint(4) DEFAULT NULL,
  `rules_active_roster_limit` smallint(6) DEFAULT NULL,
  `rules_secondary_roster_limit` smallint(6) DEFAULT NULL,
  `rules_expanded_roster_limit` smallint(6) DEFAULT NULL,
  `rules_min_service_days` smallint(6) DEFAULT NULL,
  `rules_waiver_period_length` smallint(6) DEFAULT NULL,
  `rules_dfa_period_length` smallint(6) DEFAULT NULL,
  `rules_fa_minimum_years` smallint(6) DEFAULT NULL,
  `rules_salary_arbitration_minimum_years` smallint(6) DEFAULT NULL,
  `rules_minor_league_fa_minimum_years` smallint(6) DEFAULT NULL,
  `rules_foreigner_limit` smallint(6) DEFAULT NULL,
  `rules_foreigner_pitcher_limit` smallint(6) DEFAULT NULL,
  `rules_foreigner_hitter_limit` smallint(6) DEFAULT NULL,
  `rules_schedule_games_per_team` smallint(6) DEFAULT NULL,
  `rules_schedule_typical_series` smallint(6) DEFAULT NULL,
  `rules_schedule_preferred_start_day` smallint(6) DEFAULT NULL,
  `rules_amateur_draft_rounds` smallint(6) DEFAULT NULL,
  `rules_minimum_salary` int(11) DEFAULT NULL,
  `rules_salary_cap` int(11) DEFAULT NULL,
  `rules_player_salary0` int(11) DEFAULT NULL,
  `rules_player_salary1` int(11) DEFAULT NULL,
  `rules_player_salary2` int(11) DEFAULT NULL,
  `rules_player_salary3` int(11) DEFAULT NULL,
  `rules_player_salary4` int(11) DEFAULT NULL,
  `rules_player_salary5` int(11) DEFAULT NULL,
  `rules_player_salary6` int(11) DEFAULT NULL,
  `rules_player_salary7` int(11) DEFAULT NULL,
  `rules_average_coach_salary` int(11) DEFAULT NULL,
  `rules_average_attendance` int(11) DEFAULT NULL,
  `rules_average_national_media_contract` int(11) DEFAULT NULL,
  `rules_cash_maximum` int(11) DEFAULT NULL,
  `rules_average_ticket_price` double DEFAULT NULL,
  `rules_revenue_sharing` tinyint(4) DEFAULT NULL,
  `rules_national_media_contract_fixed` tinyint(4) DEFAULT NULL,
  `rules_owner_decides_budget` tinyint(4) DEFAULT NULL,
  `rules_schedule_auto_adjust_dates` tinyint(4) DEFAULT NULL,
  `rules_historical_import_rookies` tinyint(4) DEFAULT NULL,
  `avg_rating_contact` int(11) DEFAULT NULL,
  `avg_rating_gap` int(11) DEFAULT NULL,
  `avg_rating_power` int(11) DEFAULT NULL,
  `avg_rating_eye` int(11) DEFAULT NULL,
  `avg_rating_strikeouts` int(11) DEFAULT NULL,
  `avg_rating_stuff` int(11) DEFAULT NULL,
  `avg_rating_movement` int(11) DEFAULT NULL,
  `avg_rating_control` int(11) DEFAULT NULL,
  `avg_rating_fielding0` int(11) DEFAULT NULL,
  `avg_rating_fielding1` int(11) DEFAULT NULL,
  `avg_rating_fielding2` int(11) DEFAULT NULL,
  `avg_rating_fielding3` int(11) DEFAULT NULL,
  `avg_rating_fielding4` int(11) DEFAULT NULL,
  `avg_rating_fielding5` int(11) DEFAULT NULL,
  `avg_rating_fielding6` int(11) DEFAULT NULL,
  `avg_rating_fielding7` int(11) DEFAULT NULL,
  `avg_rating_fielding8` int(11) DEFAULT NULL,
  `avg_rating_fielding9` int(11) DEFAULT NULL,
  `avg_rating_overall` int(11) DEFAULT NULL,
  `avg_rating_age` double DEFAULT NULL,
  `league_totals_ab` int(11) DEFAULT NULL,
  `league_totals_h` int(11) DEFAULT NULL,
  `league_totals_d` int(11) DEFAULT NULL,
  `league_totals_t` int(11) DEFAULT NULL,
  `league_totals_hr` int(11) DEFAULT NULL,
  `league_totals_bb` int(11) DEFAULT NULL,
  `league_totals_hp` int(11) DEFAULT NULL,
  `league_totals_k` int(11) DEFAULT NULL,
  `league_totals_pa` int(11) DEFAULT NULL,
  `league_totals_babip` double DEFAULT NULL,
  `league_totals_mod_h` double DEFAULT NULL,
  `league_totals_mod_d` double DEFAULT NULL,
  `league_totals_mod_t` double DEFAULT NULL,
  `league_totals_mod_hr` double DEFAULT NULL,
  `league_totals_mod_bb` double DEFAULT NULL,
  `league_totals_mod_hp` double DEFAULT NULL,
  `league_totals_mod_k` double DEFAULT NULL,
  `league_totals_mod_babip` double DEFAULT NULL,
  `ml_equivalencies_avg` double DEFAULT NULL,
  `ml_equivalencies_hr` double DEFAULT NULL,
  `ml_equivalencies_eb` double DEFAULT NULL,
  `ml_equivalencies_bb` double DEFAULT NULL,
  `ml_equivalencies_k` double DEFAULT NULL,
  `ml_equivalencies_hp` double DEFAULT NULL,
  `player_creation_modifier_contact` double DEFAULT NULL,
  `player_creation_modifier_gap` double DEFAULT NULL,
  `player_creation_modifier_power` double DEFAULT NULL,
  `player_creation_modifier_eye` double DEFAULT NULL,
  `player_creation_modifier_strikeouts` double DEFAULT NULL,
  `player_creation_modifier_stuff` double DEFAULT NULL,
  `player_creation_modifier_movement` double DEFAULT NULL,
  `player_creation_modifier_control` double DEFAULT NULL,
  `player_creation_modifier_speed` double DEFAULT NULL,
  `player_creation_modifier_fielding` double DEFAULT NULL,
  `financial_coefficient` double DEFAULT NULL,
  `world_start_year` int(11) DEFAULT NULL,
  `current_date` date DEFAULT NULL,
  `background_color_id` varchar(8) DEFAULT NULL,
  `text_color_id` varchar(8) DEFAULT NULL,
  `scouting_coach_id` int(11) DEFAULT NULL,
  PRIMARY KEY (`league_id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

 

Creating the sub_leagues table

The sub_leagues table is much more manageable.  It serves to name each league’s sub_leagues and attach them to a parent league.  Note here that I made a composite PK rather than adding an AUTO_INCREMENT:

CREATE TABLE `sub_leagues` (
  `league_id` int(11) NOT NULL,
  `sub_league_id` int(11) NOT NULL,
  `name` varchar(50) DEFAULT NULL,
  `abbr` varchar(50) DEFAULT NULL,
  `gender` int(11) DEFAULT NULL,
  `designated_hitter` tinyint(4) DEFAULT NULL,
  PRIMARY KEY (`league_id`,`sub_league_id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

 

Creating the divisions table

This one, too, is very straightforward.  Again, a composite primary key rather than an AUTO_INCREMENT:

CREATE TABLE `divisions` (
  `league_id` int(11) NOT NULL,
  `sub_league_id` int(11) NOT NULL,
  `division_id` int(11) NOT NULL,
  `name` varchar(50) DEFAULT NULL,
  `gender` int(11) DEFAULT NULL,
  PRIMARY KEY (`league_id`,`sub_league_id`,`division_id`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;