Poster's Sportsbook Poll: OctoberView Poll Results
1. 5Dimes 450 total points 5Dimes Review
2. Pinnacle 408 total points Pinnacle Review
3. Heritage 227 total points Heritage Review
4. Bookmaker 138 total points Bookmaker Review
5. BetIslands 129 total points BetIslands Review
SBR Top-Rated Sportsbooks Recommended List
1. Pinnacle Sports SBR Rating A+ Pinnacle Sports Review
2. 5Dimes SBR Rating A+ 5Dimes Review
3. BookMaker SBR Rating A+ BookMaker Review
4. Legends SBR Rating A+ Legends Review
5. Bodog SBR Rating A Bodog Review
 
 
View New Posts
 
LinkBack Thread Tools
Old 11-10-09, 07:29 PM   #1
Dark Horse
Deus Ex Machina
 
Dark Horse's Avatar
SBR PRO
Joined: 12-14-05
Posts: 12,817
 
Message Me
Default pdf to data file?

Is there a way to transfer pdf to data files?

This is for horses. The cards I use are in pdf. But the only way to effectively analyze all that data is with a computer program. I asked the programmer about this and he responded:

Quote:
You're the first person to ask, and I'm afraid I haven't looked at the Equibase pp files to see if the program can be adapted to use them.
Equibase owns TrackMaster now, but I doubt they'd use the TrackMaster file format, because it is in dBase III format, which is a little ancient (nothing wrong with TrackMaster data it's just that the format is dated).

I was curious, so I just looked at the Equibase site, and the $2 "Premium" PPs seems to be a .pdf file (an image of the PPs, but no data you can get at). I couldn't find any actual data file associated with it.
Give Points Quick reply to this message

SBR Founder Join Date: 12/14/2005

Old 11-10-09, 07:53 PM   #2
rk9
 
rk9's Avatar
Joined: 08-24-09
Posts: 117
 
Message Me
Default

You can just copy paste pdf docs.
highlight the document then copy (ctrl c) then paste (ctrl v) it in an excel doc or a word doc. then it will be editable for your liking.
Give Points Quick reply to this message
Old 11-11-09, 12:29 AM   #3
Wrecktangle
 
Wrecktangle's Avatar
Joined: 03-01-09
Posts: 1,492
 
Message Me
Default

yeah, I scrape 'em all the time, not sure what the issue is...
Give Points Quick reply to this message
Old 11-11-09, 06:12 AM   #4
Dark Horse
Deus Ex Machina
 
Dark Horse's Avatar
SBR PRO
Joined: 12-14-05
Posts: 12,817
 
Message Me
Default

I don't want to get them to excel. This is a computer program for horse racing that doesn't 'read' the pdf formats.

I'm wondering if this is the direction in which to look: http://www.simx.com/simx/Products.stp?prm=tc
Give Points Quick reply to this message

SBR Founder Join Date: 12/14/2005

Old 11-11-09, 12:10 PM   #5
Dave Head
 
Dave Head's Avatar
Joined: 07-22-09
Posts: 73
 
Message Me
Default

Hi rk9 and Wrecktangle,

Where are you getting your PPs from? The ones that I have found do not have text that you can copy and paste. The pdf presents the information as an image. Thanks.
Give Points Quick reply to this message
Old 11-12-09, 10:00 AM   #6
Wrecktangle
 
Wrecktangle's Avatar
Joined: 03-01-09
Posts: 1,492
 
Message Me
Default

...um, the pdfs I've worked with had a text box you can puck and then darken the page with your cursor to scrape...I wonder if they've turned it off on your application?
Give Points Quick reply to this message
Old 11-12-09, 11:41 AM   #7
Dave Head
 
Dave Head's Avatar
Joined: 07-22-09
Posts: 73
 
Message Me
Default

Hi Wrecktangle

My application is the Adobe Acrobat reader. Here is a link to one of the PDF files:

http://www.drf.com/data/samples/sample_basic_pps.pdf

You can view it by clicking on this link, but if you download it, then open it with Adobe reader, you'll see in the title bar of the window: (SECURED). it won't let you select or copy anything.

All of the free pdfs that I have found refuse to let you select or copy anything. Not all say that they are (SECURED).

I'm too cheap to pay for past performances, and I'm too cheap to get a copy of Adobe writer to see if that would make any difference.

So, let me repeat the question. Where are you getting your past performances pdfs from?

Last edited by Dave Head; 11-12-09 at 11:52 AM. Reason: formatting
Give Points Quick reply to this message
Old 11-12-09, 09:50 PM   #8
Wrecktangle
 
Wrecktangle's Avatar
Joined: 03-01-09
Posts: 1,492
 
Message Me
Default

Yep, locked up, sorry.

...you can't make money on ponies anyway...vig is too high...do baskets: we're whacking 'em
Give Points Quick reply to this message
Old 11-12-09, 11:51 PM   #9
rk9
 
rk9's Avatar
Joined: 08-24-09
Posts: 117
 
Message Me
Default

Sorry Dave, I usually get my info from other sources than pdf webpages for stats and so on. Most docs that ive seen that are pdf can be copied. If not you can usually save them as text and get around it that way. Like wrecktangle said this one seems to be pretty locked up.

Dark Horse- I was just using excel as an example. you should be able to copy paste most pdfs to a lot of different type of files. Im not exactly sure what you mean by data file, that seems kind of vague. that software program looks interesting. If those outputs are the format you want the files in, then that looks like a decent but somewhat expensive option.
Give Points Quick reply to this message
Old 11-13-09, 06:00 AM   #10
Dark Horse
Deus Ex Machina
 
Dark Horse's Avatar
SBR PRO
Joined: 12-14-05
Posts: 12,817
 
Message Me
Default

Quote:
Originally Posted by rk9 View Post
Dark Horse- I was just using excel as an example. you should be able to copy paste most pdfs to a lot of different type of files. Im not exactly sure what you mean by data file, that seems kind of vague. that software program looks interesting. If those outputs are the format you want the files in, then that looks like a decent but somewhat expensive option.
I asked the programmer. It's too complex for that solution too:

Quote:
I'm afraid that won't help, since the data needs to be in a very exact format. Every data supplier has their own, but you can picture it like a big spread sheet. BRIS, for example, has over 1400 columns and each column holds one piece of data about that past performance. Every past performance of every horse is one row, 1400+ cells wide. (So if there are 10 horses in a race and each has 10 past performances, that is 100 rows for one race, and if there are 10 or 12 races, that makes a "spreadsheet" 1400+ cells wide X 1200+ rows in height.

As I said, each data supplier has their own data standard, and for example, column 128 might be Trainer's First Name, and it always must contain text, not a number, etc., so it would be close to impossible to convert it in any way that would be exact enough to use.
Give Points Quick reply to this message

SBR Founder Join Date: 12/14/2005

Old 11-13-09, 11:42 AM   #11
TJMAXX
 
TJMAXX's Avatar
Joined: 05-22-09
Posts: 19
 
Message Me
Default

nevermind...

Last edited by TJMAXX; 11-13-09 at 11:50 AM.
Give Points Quick reply to this message
 


Thread Tools



All times are GMT -5. The time now is 05:39 PM.


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41