08-27-08, 04:49 AM
|
#1
|
|
|
Historical Database: Spreads/Totals
Is anyone aware of a place where I can access a historical database of spreads and O/U totals for past sporting events? I'm really only interested in football (NFL more than college but I'll take what I can get). I'm new to this stuff and am trying to figure out how to back test a few ideas I've had floating in my head the past few days. I'm a college student and don't really have much expendable cash so cost is an issue. If anyone has any info that might be helpful I'd greatly appreciate it!
|
|
|
08-27-08, 10:07 AM
|
#2
|
|
|
Just go to covers, you can copy and paste scores and games from there into excel, have your own little database for free in no time.
And cost will not be an issue!
I think you are always better off building any database you plan to use anyway, then you know what you have.
|
|
|
08-27-08, 09:43 PM
|
#5
|
|
escarbajo negro
|
Nice job.
I wish I had that page before i spent a thousand hours and dollars collecting all that myself. 
|
|
|
08-27-08, 10:30 PM
|
#7
|
|
|
yea np it only took 8 hours or so to get every sport from covers, the ATSLines went much quicker
once you write the script for one its easy to get them all
about the quarter scores, that website i looked at it but i only know how to efficiently scrape data that is embedded in HTML
that website puts the boxscores in a <pre> tag which seems like it'll be a huge pain to figure out how to scrape it
that sunshine sites spreads are really off i noticed, i think he is using opening lines for it from the paper.
i'm working on writing a script that will just automatically update the new csv to the website everyday, but i've been lazy
the covers lines are also off a lot from the ATSLines ones. I think Covers is more accurate though, but depending on which one you're using you could get big difference in results
|
|
|
08-27-08, 10:34 PM
|
#8
|
|
escarbajo negro
|
yep, i'm learning to program now an realizing that if i'd learned this stuff a couple years ago when i was basically making my nfl database (which is basically worthless anyway since i hardly bet nfl anymore) manually i could have saved myself an inordinate amount of time (and money since i paid someone to do it for other leagues). oh well, it should help me tremendously going forward.
thanks for the scripts, they should help.
|
|
|
08-27-08, 11:24 PM
|
#9
|
|
|
Quote:
Originally Posted by rsigley
about the quarter scores, that website i looked at it but i only know how to efficiently scrape data that is embedded in HTML
that website puts the boxscores in a <pre> tag which seems like it'll be a huge pain to figure out how to scrape it
|
Seems like you need to search for 12 spaces and then grab the strings on the right after 7th and 9th successful searches.
Regardless, take a look at this site as well:
http://www.archive.org/web/web.php
|
|
|
08-28-08, 12:36 AM
|
#10
|
|
|
Good stuff rsigley. Appreciate it.
|
|
|
08-28-08, 04:10 PM
|
#11
|
|
|
Thanks rsigley. Echo the other sentiments here and very much appreciate it.
__________________
Hartford Whalers
1972-1997
Long Live the Whale
2006 Stanley Cup Champions
|
|
|
08-29-08, 05:36 AM
|
#12
|
|
A Doll's House
|
Errors in the NCAAFB ATSLines DB:
10/1/05, Mich/Mich St, score is reversed, Mich won 34-31
9/22/07, Mich/Penn St, duplicate entry
9/24/05, UNC/NC St, score is reversed, UNC won 31-24
|
|
|
08-29-08, 11:30 PM
|
#13
|
|
A Doll's House
|
9/3/05, Colorado/CSU, score should be 31-28
9/10/05, Iowa St/Iowa, score should be 23-3
|
|
|
08-30-08, 01:06 AM
|
#14
|
|
A Doll's House
|
9/25/99, E Carolina/Miami FL, score should be 27-23
11/13/05, Houston/SMU, away team was So Miss not SMU
|
|
|
08-30-08, 02:54 AM
|
#15
|
|
A Doll's House
|
12/1/07, UCF/Tulsa, score should be 44-23
8/28/99, Notre Dame/Kansas, score should be 48-13
|
|
|
08-31-08, 10:59 AM
|
#16
|
|
|
whops yea if its wrong in there its wrong on the website since i didn't manually input the information
i found a couple errors in the covers db's too in nfl, but i think the one up there is the correct version
|
|
|
09-01-08, 06:58 PM
|
#17
|
|
A Doll's House
|
12/25/00, Arizona/BC, should be Arizona St, not Arizona
|
|
|
09-01-08, 08:58 PM
|
#18
|
|
A Doll's House
|
10/6/07, Ball St/Central Mich, score should be 38-58
|
|
|
09-01-08, 09:29 PM
|
#19
|
|
A Doll's House
|
11/13/04, Florida/S Carolina, score should be 48-14
|
|
|
09-01-08, 09:51 PM
|
#20
|
|
A Doll's House
|
12/6/03, Georgia/LSU, score should be 13-34
|
|
|
09-01-08, 10:45 PM
|
#21
|
|
A Doll's House
|
10/29/05, LSU/N Texas, score should be 56-3
|
|
|
09-01-08, 11:44 PM
|
#22
|
|
|
Quote:
Originally Posted by rsigley
yea np it only took 8 hours or so to get every sport from covers, the ATSLines went much quicker
once you write the script for one its easy to get them all
about the quarter scores, that website i looked at it but i only know how to efficiently scrape data that is embedded in HTML
that website puts the boxscores in a <pre> tag which seems like it'll be a huge pain to figure out how to scrape it
that sunshine sites spreads are really off i noticed, i think he is using opening lines for it from the paper.
i'm working on writing a script that will just automatically update the new csv to the website everyday, but i've been lazy
the covers lines are also off a lot from the ATSLines ones. I think Covers is more accurate though, but depending on which one you're using you could get big difference in results
|
FYI, you should teach yourself one of the Visual Studio languages. It's pretty simple to parse HTML files and mine any tags or sets of tags (even nested ones) using the webbrowser control and regular expressions. The syntax for regular expressions is a bit cumbersome at first. However, once you learn it, it's a powerful tool for parsing. There are also utilities (like RegEx Buddy) that can help you learn and format your string searches.
|
|
|
09-02-08, 12:03 AM
|
#23
|
|
A Doll's House
|
9/17/05, New Mexico/New Mexico St, score should be 38-21
|
|
|
09-02-08, 03:12 AM
|
#24
|
|
A Doll's House
|
10/22/97, S Carolina/Kentucky, game does not exist
|
|
|
09-02-08, 03:23 AM
|
#25
|
|
A Doll's House
|
11/27/2004, S Carolina/Georgia Tech, home team should be Georgia
|
|
|
09-02-08, 03:49 AM
|
#26
|
|
A Doll's House
|
9/5/99, TCU/Arizona, score should be 31-35
|
|
|
09-02-08, 06:53 PM
|
#27
|
|
A Doll's House
|
10/23/04, UL-Lafayette/Arkansas St, score should be 27-24
|
|
|
09-02-08, 06:54 PM
|
#28
|
|
A Doll's House
|
10/18/03, UL-Lafayette/New Mexico St, score should be 26-24
|
|
|
09-02-08, 07:17 PM
|
#29
|
|
A Doll's House
|
9/10/05, Utah/Utah St, score should be 31-7
|
|
|
09-02-08, 07:30 PM
|
#30
|
|
|
nice rsigley, thanks for sharing
|
|
|
09-02-08, 10:07 PM
|
#31
|
|
A Doll's House
|
9/18/04, UL-Monroe/Arkansas, line should be +29.5
|
|
|
09-02-08, 10:10 PM
|
#32
|
|
A Doll's House
|
1/2/02, Maryland/Florida, line should be +14.5
|
|
|
09-02-08, 10:12 PM
|
#33
|
|
A Doll's House
|
10/6/01, LSU/Florida, line should be +14
|
|
|
09-02-08, 10:30 PM
|
#34
|
|
A Doll's House
|
1/2/98, Tennessee/Nebraska, line should be +13
|
|
|
09-02-08, 10:33 PM
|
#35
|
|
A Doll's House
|
12/18/01, N Texas/Colorado St, line should be +11
|
|
|
|
|