|Historical Minor League Statistics|
About the project
As part of SABR's mission to preserve and disseminate materials relating to the history of baseball, the Minor Leagues Committee has undertaken community project to document the statistical history of professional baseball. The core focus of the project is to compile statistics for each league-season, using the best information available. Each season's statistics are critically examined before publication, and known errors and omissions from Guides and other sources are corrected.
Data coverage and sources
The project uses the league-season (one full season of one league) as the basic unit of statistical compilation. Leagues are scheduled for compilation in reverse chronological order. The current focus of new input and evaluation is the 1989 season.
Once in electronic form, we review all statistics for errors, both in transcription and in balance. We check whether team statistics are the sum of its players', and whether totals such as runs, hits, and so forth balance between batters and pitchers. This process ensures data quality, and also often catches errors in published totals. This process is also labor-intensive, which means it takes a while for a season to achieve quality certification in the database. We appreciate your patience in allowing us to bring you a quality resource.
We use the best information available in compiling statistics for a league. Official league statistics and tabulations published in major guides are used for most leagues. We also build on research done by SABR members and others in correcting and extending those publications.
Quality control: The proofing and balancing process
The list of leagues with statistics that have passed the proofing process is here. These leagues are grouped into two categories. Leagues listed as proofed have been reviewed for inconsistent or suspicious entries in individual player statistics. In addition, the statistics pass internal consistency checks, including that the sum of individual player statistics add to team totals, and totals for batters equal totals for pitchers in categories such as hits, runs, etc. which must by definition be the same. Few league statistics balance perfectly, even in the era of computer tabulation; leagues in this category have at most minor balance errors.
Leagues listed are reviewed have been reviewed for inconsistent or suspicious entries in individual player statistics only. Leagues appear in this category when there is some limitation preventing a complete set of consistency checks. Some examples of this include a lack of published team statistical totals, or the absence of "less thans" (players appearing in a small number of games) from the official statistics. We present statistics for leagues in this category when we believe the statistics are the best available. Since the balancing process often uncovers subtle errors in a league's statistics, statistics for leagues in this category should be viewed as possibly less reliable than those listed as proofed.
Information from other sources
For leagues for which full statistics have not been compiled and vetted in electronic format, we offer selected statistics for players based on a database compiled by Ed Washuta and donated to SABR in 2007. Due to the sheer size of the task, we regret that changes to the Washuta data, including statistical errors and the addition of statistics for unlisted players, will only made in the case of extreme and egregious errors.
Is the database available in (MySQL, CSV, etc.) format?
The statistical history of minor league baseball is very poorly documented. We view most of the statistics we currently display as being provisional, and we anticipate this will be the case for some time. We believe it is unwise to release downloadable datasets which are immature and have not been cross-checked for quality. It is our plan to offer downloads of a full year's worth of statistics (for all leagues) once all leagues in that year have completed the proofing process.
How can I help?
The development of the database is powered entirely by volunteer effort. With a goal of providing statistics for over 4,000 league seasons, volunteers are needed to compile, cross-check, verify, and fill in gaps in statistics. Much of the work involved is data entry and validation. If you're a SABR member, you can inquire about volunteering by starting a discussion in the SABR Minor Leagues Committee group on SABRNation.