View New Posts
  1. #1

    Default Scraping

    I'v read a lot on here about people "scraping" data from a website and converting it into excel (i think?).

    Now, i have like 0 knowledge of programming, however i am good at excel, so i am curious how do-able this is, and how hard it would be for me to learn to do this.

    I would like to take a website such as http://sports.yahoo.com/nhl/players/...y7eJzqAEdivLYF and convert all those stats into excel.

    Now i realize i can just highlight the data, copy and paste it, and clean it up, showing me the columns i'd like to see. I guess i can just keep copy and pasting values, knowing which columns theyll be in, and set up equations to only extract data from those columns.

    My question is...is there a faster way to do this? Is there a way i can just simply change the player # in the yahoo sports game log link, and have excel automatically extra data the way i want it to? I guess i just want to cut down on taking the time to copy and paste the table and cleaning it up, was hoping there was some kind of code to do that for me?

  2. #2

    Default

    Try the sticky at the top of the Think Tank...

    Intro to research

  3. #3

    Default

    Quote Originally Posted by Full Time Hobo View Post
    Try the sticky at the top of the Think Tank... Intro to research
    just follow ssk, the ats atm

  4. #4

    Default

    Pay someone to do it.

    Fastest way. You ramping up to learn how will take ages.

    OR... Look online for some application that takes an html table, and convert it to csv. That will same ALOT of time and headaches. But still require some time to ramp up, to understand how to use whichever application you can find

  5. #5

    Default

    Quote Originally Posted by Maverick22 View Post
    Pay someone to do it.

    Fastest way. You ramping up to learn how will take ages.

    OR... Look online for some application that takes an html table, and convert it to csv. That will same ALOT of time and headaches. But still require some time to ramp up, to understand how to use whichever application you can find
    For someone with 0 knowledge, I'd go w/ Maverick's first option. The time lost + high probability of mashing your data in a dangerous way (ex: flipping td and ints or something) will be much more expensive than paying someone.
    7,375

    SBR POKER TOP 100

    60th Place 11/1/2011

    iPad
    WINNER
    SBR Store 09/13/2011


  6. #6

    Default

    WOuld someone on here be willing to take payment via Paypal for something like this or are you recommending we go find a local to take care of this for us? I wouldn't even know where to begin to look for someone to something like this, except here.

  7. #7

    Default

    Quote Originally Posted by Cap dat 4ss View Post
    WOuld someone on here be willing to take payment via Paypal for something like this or are you recommending we go find a local to take care of this for us? I wouldn't even know where to begin to look for someone to something like this, except here.
    Well, if you PM on paying someone for anything here at SBR, the mods will pull your PM privileges, so this isn't a decent place to advert either.

    Oh BTW, if you read between the lines: PMs aren't P either.

  8. #8

    Default

    Quote Originally Posted by Wrecktangle View Post
    Oh BTW, if you read between the lines: PMs aren't P either.
    Were the fuk SBR get this idea from. Just like the US government they think they got the right to read your PM's. Complete bullshit.
    225pts

    SBR POKER TOURNEY4th Place 5/23/2012


  9. #9

    Default

    I'd have a look at excel's 'Import External Data' first. It would be easy to try:

    Go to page that you want in a browser (or have the url handy).
    In Excel to to the menu Item 'Data -> Import External Data -> New Web Query' and Select.

    A window will appear - highlight the url at the top and paste in your url. Hit the options button and disable date recognition. Press the 'Go' button and the site that you are interested in will appear.

    Decide what data you want and click the yellow marks to select - the marks will turn into green checks.

    Hit 'Import' and boom - the page will appear in excel all separated into columns. Save as a .csv file ....

    LT


    Quote Originally Posted by Cicima6709 View Post
    I'v read a lot on here about people "scraping" data from a website and converting it into excel (i think?).

    Now, i have like 0 knowledge of programming, however i am good at excel, so i am curious how do-able this is, and how hard it would be for me to learn to do this.

    I would like to take a website such as http://sports.yahoo.com/nhl/players/...y7eJzqAEdivLYF and convert all those stats into excel.

    Now i realize i can just highlight the data, copy and paste it, and clean it up, showing me the columns i'd like to see. I guess i can just keep copy and pasting values, knowing which columns theyll be in, and set up equations to only extract data from those columns.

    My question is...is there a faster way to do this? Is there a way i can just simply change the player # in the yahoo sports game log link, and have excel automatically extra data the way i want it to? I guess i just want to cut down on taking the time to copy and paste the table and cleaning it up, was hoping there was some kind of code to do that for me?

  10. #10
    Flight's Avatar SBR PRO
    Join Date: 01-27-09
    Posts: 1,979
    SBR Points: 12711
    Message Me

    Default

    A page like the one you linked sometimes copies well into Excel straight up. Try all browsers - IE, Firefox, Chrome. Some will copy paste to excel better than others for certain tables.

    For ESPN pages I copy paste straight to Excel. I can analyze player props within 5 minutes with this method. The only thing that a scraper would do would be to bring that number down under a minute. This would be nice, but I still have yet to do it... I just stick with the manual approach when it comes to player props. But if you need to update constantly (like NHL or MLB), then yah you need a sweet program to get data from the internet.

  11. #11

    Default ncaa boxscores

    well i code scrapers almost exclusively. i am attaching 'scraped' boxscores for the the first 4 weeks of ncaa season. if have any questions or need explanation of columns send me a pm. as always these are free to use and distribute. change the extension to .csv and pull them into Excel. Enjoy
    Attached Files

  12. #12

    Default

    here are co0lumn headings for above files:
    1st downs,3rd dwn eff (2col), 4th dwn eff, tot yds, ps yds, com/att (2col),yds/ps,rsh yds, rsh att,yds/rsh,penal (2col),fumbles lost,ints, top (secs)

  13. #13

    Default

    Quote Originally Posted by Wrecktangle View Post
    Well, if you PM on paying someone for anything here at SBR, the mods will pull your PM privileges, so this isn't a decent place to advert either.

    Oh BTW, if you read between the lines: PMs aren't P either.
    Really? I made an offer to write scrapers for people on this forum and no one from SBR said anything. Can I get paid in points?

  14. #14

    Default week 5 boxes

    a couple rows (A&M & nor ill) need a bit of edit
    Attached Files

  15. #15

    Default

    Quote Originally Posted by podonne View Post
    Really? I made an offer to write scrapers for people on this forum and no one from SBR said anything. Can I get paid in points?
    Keep it up, they'll get you eventually.

    But if you'd rather, I can turn you in...your choice.

  16. #16

    Default

    Quote Originally Posted by Wrecktangle View Post
    Keep it up, they'll get you eventually.

    But if you'd rather, I can turn you in...your choice.
    Not trying to do anything wrong. I posted my own thread and no admins\mods commented there either. I never saw anywhere where it said you couldn't offer other your services to other handicappers on here. Just trying to make a few bucks during unemployment, and since I write so many scrapers myself, seemed natural.

    I'll contact SBR directly as ask what their policy is.

  17. #17

    Default

    Quote Originally Posted by Wrecktangle View Post
    Keep it up, they'll get you eventually.

    But if you'd rather, I can turn you in...your choice.
    Whoever said you were a douchebag? They were OBVIOUSLY wrong.

  18. #18

    Default

    Quote Originally Posted by podonne View Post
    Not trying to do anything wrong. I posted my own thread and no admins\mods commented there either. I never saw anywhere where it said you couldn't offer other your services to other handicappers on here. Just trying to make a few bucks during unemployment, and since I write so many scrapers myself, seemed natural.

    I'll contact SBR directly as ask what their policy is.
    I'd let him tattle on you instead.

  19. #19

    Default

    Quote Originally Posted by MonkeyF0cker View Post
    Whoever said you were a douchebag? They were OBVIOUSLY wrong.
    Monkey droppings, my one stab at your sarcastic games...

    ...no where near your level, obviously.

  20. #20

    Default

    The world needs more douche bags. Noone likes smelly vaginas.

  21. #21

  22. #22

  23. #23

    Default

    Google Excel SMF Add-In...may save time from learning to code or buy you time while learning

Top