Sacred Data

If you want to work with any data on the T-Rank site, please get in touch with me—I'm happy to share and most of it is available in bulk on the site without the need to scrape.

For example, much of the data is available at the site in .csv and .json files in the format of XXXX_team_results.csv (or .json) where XXXX = the year. So, for example, http://barttorvik.com/2019_team_results.csv gives final stats from last season. These files update constantly during the season.

For player stats, see the first comment below. Also, please search the comments below because I’ve answered many questions in them over the years. 

Sometimes I notice mass scraping operations that are detrimental to site performance, and I take efforts to block those. If that happens to you and your aims were not malicious, let me know.

126 comments:

  1. Hi. I wanted to pull player stats from 2009 to 2016 for a school project. Is there any way could help me get the csv files for each year?

    Thanks

    ReplyDelete
    Replies
    1. csvs for player stats are available on my site at getadvstats.php?year=2009&csv=1 (change the year for other years)

      The column header info is available here: https://www.dropbox.com/s/ryugeykvntto5ji/pstatheaders.xlsx?dl=0

      Delete
  2. Hey I am attempting to pull lineup/player efficiency numbers but cannot find a reliable boxscore api feed with subsitutions. Can you share where you are pulling your data?

    ReplyDelete
    Replies
    1. I use a variety of sources. I've a paid subscription to the feed at natstat.com and also fill in gaps from stats.ncaa.org if necessary. But I don't parse play-by-play for subs (on/off) so not exactly sure if this will help you.

      Delete
  3. Hi Bart, just want to say thanks very much for all your data. Your work is really engaging, and it has been a big hit for us over at No Bid Nation (the only William & Mary-focused basketball blog). I am hoping to put together a model to track the CAA this year, and I will be sure to give you credit!

    ReplyDelete
  4. Hello,
    Do you have a returning production data point? I am happy to compile it myself from a csv file if the compiled data points are available.

    Sincerely,
    Kevin

    ReplyDelete
    Replies
    1. I typically calculate "returning possession minutes" for preseason projections https://www.barttorvik.com/rpms.php

      Delete
  5. Checkout the bigballR R package! Even if you aren't familiar/fluent in R programming the package has functions that will enable you to download/calculate play-by-play/stats (including lineup and on/off stats) and save data it as a csv with only a couple lines of code. Checkout the package's github page (https://github.com/jflancer/bigballR) that includes a handful of examples that should be a big help.

    ReplyDelete
  6. Hi! Is there an easy way to access "Today's Games" with each matchup and its predicted winner, spread, and probability? I'm looking to pull games from 2012-2019.

    ReplyDelete
    Replies
    1. This information is available at YEAR_results.csv - but it only goes back to 2015.

      Delete
    2. Bart- this is a great site so kudos to you and the rest of the crew for compiling this information. I downloaded the YEAR_results.csv files and cannot figure out what the last two columns represent. Can you tell me or point to column headers file? Thanks!

      Delete
    3. I believe the last two columns are pregame "Torvik Thrill Quotient" and pregame projected tempo.

      Delete
  7. Hello, big fan of your content. I run a sports betting YouTube channel a major focus point is a monte carlo simulation model I use. I have used a scraper for ncaa.org for years, but with the mass cancellations this year, its been a bit of a pain, but I've been able to work around it. However, there are still some games missing data, such as Eastern Illinois-UW Green Bay from December 5: https://stats.ncaa.org/contests/1983012/box_score

    I've only found that game and UTEP-St. Mary's that have returned a "Box Score Not Found". It's only 2 games, but still, it bothers me. So I am interested in your thoughts about NatStat as you said you subscribe. I don't need play by play data, just box score data. Is it worth it for just that? Or should I just let go the very small percentage of games on ncaa.org that have no data and not worry about them.

    Thanks, William

    ReplyDelete
  8. Bart: I just found out about your stats website. My bad! I am IndyStar's Butler beat writer and am surprised to see Aaron Thompson 19th in player rankings. It has long been evident how valuable he is, but somehow you have quantified that. If you don't mind, please send short explanation: david.woods@indystar.com.

    ReplyDelete
  9. Hi Bart! This data is awesome! I'm doing data analysis on home field advantages during COVID, but it looks like there is a slight problem with the first few columns of 2021_results.csv. It looks like it is combining both teams and the date into a single column, so the first game of the 2021 season looks like this: McNeese St.Nebraska11-25. Do you have an easy fix for that?

    ReplyDelete
    Replies
    1. Hi Carver. That is intentional, as that field is what I use as a unique gameID. There is a file at YEAR_super_sked.csv that has more information.

      Delete
    2. Hi Bart,
      Do you have a guide to what the columns are in YEAR_super_sked.csv?

      Delete
    3. Not really - best I can do is this;

      muid, date, conmatch, matchup, prediction, ttq, conf, venue, #0 - 7
      team1, t1oe, t1de, t1py, t1wp, t1propt, team2, t2oe, t2de, t2py, t2wp, # 8 - 18
      t2propt, tpro, t1qual, t2qual, gp, result, tempo, possessions, t1pts, #19 - 27
      t2pts, winner, loser, t1adjt, t2adjt, t1adjo, t1adjd, t2adjo, t2adjd, #28 - 36
      gamevalue, mismatch, blowout, t1elite, t2elite, ord_date, t1ppp, t2ppp, gameppp, #37-45
      t1rk, t2rk, t1gs, t2gs, gamestats, overtimes, t1fun, t2fun, results] #46-54

      Delete
  10. Hey Bart! do you have a .csv file for all team stats?

    ReplyDelete
  11. Hi Bart! Is there any way to download pre-tournament team statistics from the last few years?

    ReplyDelete
    Replies
    1. Couple ways to do this.

      1) You can use the T-Rank Time Machine (https://barttorvik.com/trank-time-machine.php) to get the actual ratings on the day after Selection Sunday. Those data files are available at /timemachine/team_results/YYYYMMDD_team_results.json.gz(compressed json files)

      2) You can filter the main page to just pre-tournament games by selecting only Regular Season games in the "type" drop down. This doesn't give the exact pre-tourney adjusted efficiency because it doesn't account for the recency bias that the actual ratings use. You can accomplish the same thing by setting the date ranges to end at Selection Sunday.

      This data can be pulled at, e.g. teamslicejson.php?year=2019&json=1&type=R (for 2019). Change "json=1" to "csv=1" for a csv. (I leave it as a fun project for your to figure out the columns.)

      Delete
    2. Hey Bart - Is there any way to get data for 2009 and 2010 using step 1 above? Looks like data only populates back to 2010. Thanks!

      Delete
    3. Sorry, at this point the time machine only goes back to 2011 season.

      Delete
    4. Hey Bart,

      First of all, thanks for all of your work. The website and data are really cool and it's clear you're passionate about this stuff!

      I'm trying to pull pre-tournament data for the teams for the 2021 season and tried your method listed above. However, I am noticing that it isn't taking into account conference tournament games if you set the date any earlier than May 1 2021. For example, the data through May 1 says Abilene Christian's record is 24-5, but if you move it back a day to April 30, 2021 it removes (presumably) the conference tournament data while keeping the NCAA tournament data (ACU's record drops to 20-5, which incudes their 1-1 record in the NCAA tournament but does not include their 4-0 conference tournament record). It appears that the other statistics are affected as well (the Barthag and other stats change if you go back a day).

      Thanks!
      Ryan

      Delete
    5. Hi Ryan - thanks for the kind words. Sorry didn't see this comment till now -- but I believe we corresponded on Twitter about this. In case anyone else runs into this issue, what was actually happening by chancing the date to April 30th was that it took the system off the "real" ratings that incorporate recency (this happens whenever you customize anything) and the records were different because the non-D1 games don't count for the ratings are only included in the win-loss record when the "real" ratings are displayed just so that the record matches the official ncaa win loss records.

      Delete
  12. Hey Bart, is it possible to get the T-Ranketology Now data in json format?

    ReplyDelete
    Replies
    1. There is a file at now_inprob.json

      Delete
    2. thank you, is there a way to include the seed or to sort it by the seed?

      Delete
    3. the "score" is in there (the sixth element for each team) so if you can manipulate the data in your programming language of choice it should be trivial to sort by that.

      Delete
    4. thanks! indeed it does appear that sorting on the sixth element for each team manipulates the data into the correct order for almost all of the 1-12 seeds.

      maybe you can help me further, as i am trying to build a visual representation of the T-Ranketology Now bracket. i can sort on the score element to get most of the 1-12 seeded teams. however, it seems natural that a lot of the First Teams Out have higher scores than the teams that would be seeded 13-16... do you know if it might be possible to use this data to seed teams 13-16 correctly as well?

      Delete
    5. I've created a new file at now_seeding.json that has the projected tourney teams in order of score

      Delete
    6. amazing, thank you so much!!!!

      Delete
  13. Hi Bart,

    Thank you for all you do for the CBB community. Do you have a JSON/CSV file with information on quad 1/2/3/4 wins that includes who team x has beaten in each quadrant?

    ReplyDelete
    Replies
    1. The closest thing I have set up is a file at columns_now.json - it's a poorly organized json file but elements 8 - 11 are dictionary/objects that show who each team has played in each quadrant (8 is Q1, 9 is Q2, etc) but it is not broken down by wins & losses.

      Delete
    2. Okay, that's a start. Thanks. Is there JSON for each team's schedule with results? Maybe I could map the quadrant names from columns_now.json to values in the results file.

      Delete
  14. Hey Bart! big fan of the website and thanks so much for making all of that data available to us! I'm trying to use your super_sked dataset for a class I'm in, and I was just wondering though if you'd possibly be able to share what the column headers are for that dataset? Some are pretty self-explanatory but others I'm not quite sure, thanks again!

    ReplyDelete
    Replies
    1. Sorry I don't actually have this have this easily accessible in a way that would make much more sense so I prefer to leave it as a little puzzle ;)

      Delete
  15. Hi Bart - This is so cool. Is the data from the Teamsheets Rank page available in a .csv?

    ReplyDelete
  16. Hi Bart! Is there a CSV file or Json for a team's schedule and a result of the matchup? We found this page,https://barttorvik.com/results.php?team=Memphis&begin=20081101&end=20090501&conlimit=All&year=2009&top=0&hteam=&quad=5&rpi=&f=1, and we're hoping to find a source of this data without having to scrap it. The statistics you post are really awesome!

    ReplyDelete
    Replies
    1. getgamestats.php?year=2008&tvalue=Memphis will get you most/all of those stats in json.

      Delete
  17. Hi, Bart. Fantastic website! I am doing a school project on NCAA Tournament teams and would love to download your data for just NCAA Tournament teams each year from 2008-2019. Is there a CSV file for that? For instance, I would like to download all data from a page like this for each tournament: https://barttorvik.com/trank.php?year=2008&sort=&top=0&conlimit=All&venue=All&type=T&lastx=0#

    Thanks so much.

    ReplyDelete
    Replies
    1. if you put "&json=1" or "&csv=1" into the URL, you should get the data.

      Delete
  18. Hi Bart, is there anyway to view team strength of schedule ranks over a multi year span? (specifically looking for the 3 seasons from 2018-2021)

    ReplyDelete
    Replies
    1. Here is one way: https://barttorvik.com/program-maps.php?tvalue=Wisconsin&year=2021&sort=&t2value=None&avg=all&top=0&quad=4&venue=All&type=All&xax=99&yax=38

      Delete
  19. Noticed some missing data from Wichita's last game: https://www.barttorvik.com/box.php?muid=CincinnatiWichita+St.3-13&year=2021
    Not sure how this affects anything else related to your ratings.

    ReplyDelete
    Replies
    1. Weird, thanks for letting me know. SHouldn't affect the ratings, but does affect player stats.

      Delete
  20. Hi Bart,

    Great site. Love all the work you do. I'm curious if versioned Team data is available for download? That is, do you have and would you make available the team data from each day of the past few seasons (e.g. 2/17/2019, etc.)?

    ReplyDelete
    Replies
    1. data files are available at /timemachine/team_results/YYYYMMDD_team_results.json.gz(compressed json files)

      Delete
  21. Bart,

    Thanks for an amazing resource.

    Any chance you could leave players on the transfer page after they have committed to a new school? It would be interesting to be able to compare incomings based on Porpagatu! (or whatever else you want).

    ReplyDelete
    Replies
    1. Stats for committed transfers are here: https://barttorvik.com/playerstat.php?link=y&year=trans&minmin=0&start=-11101&end=trans0501

      Delete
  22. Hi Bart,

    Big fan of the site.

    I have been getting the advanced game stats for each game using getgamestats.php?year=2021 and I was wondering if there is anyway to get the raw totals for each game (like total turnovers, total rebounds, etc.) in a similar format as well.

    Thanks in advance

    ReplyDelete
    Replies
    1. those are available in the year_super_sked.json file or the year_season.json file.

      Delete
  23. Hey Bart, thanks for a great resource and being responsive.

    I was wondering if there is a strength of schedule data point? I know you adjust several things based on schedule strength, but I was looking for SOS as a specific number and maybe I'm dumb, but I'm unable to find it.

    If it is available, I am looking for it for multiple years as well.

    Appreciate any help you can provide.

    ReplyDelete
    Replies
    1. Hello,

      There are SOS metrics on the team page, and a summary table here:

      https://barttorvik.com/sos.php?year=2021

      Delete
    2. OK I could be going brain dead again, but I was able to load the CSV of this for 2021, but 2020 is not working. Or I have just forgotten how to do it.

      Appreciate any help. I was trying to put player PORPAGATU! by year with SOS by year dating back to 2009 (but probably didn't really need to go back that far, that's just what I saw on a previous question so for some reason I picked it.

      Delete
    3. Feel free to not post this msg. Just to clarify the previous.

      Actually no, it wasn't the schedule data I got, it was a copy of something I had already loaded.

      I am attempting to use the year=xxxx&csv=1 method.

      Delete
    4. Hi, not sure I'm following completely but there is no CSV available for that SOS page - cant just pull down the table though.

      Delete
  24. Hey Bart,

    Thanks for all of the amazing data you make available.

    I would like to use the player advanced stats gamelog data to pull stats like ORTG for a player by game. Files in json or csv would be great.

    Thanks,

    Wilson

    ReplyDelete
    Replies
    1. Hi Wilson,

      This data is available at YEAR_all_advgames.json

      Delete
    2. Hey Bart, love your stuff. This is fantastic!

      Is this page also updated during the season, the game by game stats?

      Delete
  25. Hi Bart,

    I am trying to navigate to the 2021 team shooting split TOTALs on your site, but it keeps taking me to the 2021 T-Rank page when I try to navigate there. It seems 2021 is the only year with this issue. Any other way I can see this data?

    Thanks!
    Jason

    ReplyDelete
    Replies
    1. Sorry about that, should be fixed now (just a bug related to going live with some 2022 stuff and my bad programming skills)

      Delete
  26. Hi Bart,

    Recently I went to go pull data in the JSON format through the JSONIO package in R, but I received an error that the connection could not be opened. This was odd since I've used this method many times in the past. Through some more digging I found that this may be a problem with the SSL certificate. On September 30th Let's Encrypt (who issued the certificate for your website) had their root certificate expire which meant that some connections will no longer work. They claim that there is some form of fix for this, but it's beyond me. Do you have any knowledge of this issue/potential workaround?

    Much appreciated, thanks.

    ReplyDelete
    Replies
    1. Hello. I'm sorry I don't know anything about the issue with Let's Encrypt's certificate not working. I frankly barely understand the SSL stuff at all, so it's somewhat miraculous that I got it set up at all, and if LE stops working that would be very bad.

      I do call some of my own data files through python, and that still seems to work.

      One thing you might try is just changing the URL you use to pull the data so that it starts "http://" instead of "https://"

      Delete
    2. Update on this. I discovered that my Tourneycast simulations actually broke after 9/30 for this reason because I run that script on my local PC, and the version of Python/requests on that machine was having this issue with the expired SSL root certificate. I upgraded the requests module for python, and that fixed it. So one thing you may wish to try is seeing if there is an update to the JSONIO package you're using for R, or else trying a difference package to pull down the data. If you figure it out, please do let me know.

      Delete
    3. After a very arduous process I have come up with a solution that so far has proven to work in R using the httr library.

      library(httr)
      set_config(config(ssl_verifypeer = FALSE))
      options(RCurlOptions = list(ssl_verifypeer = FALSE))
      options(rsconnect.check.certificate = FALSE)
      WebScrape <- GET("https://barttorvik.com/2021_team_results.json")
      Data <- as.data.frame(do.call(rbind, lapply(content(WebScrape,"parsed"), as.vector)))

      Obviously this then needs to be cleaned up, but it will return the proper JSON dataset.

      Delete
  27. Is there any way to see previous year's preseason projections? (such as the preseason ratings for 2019)

    ReplyDelete
    Replies
    1. Yes prior seasons are at trankpure19.php (change the year for other years)

      Delete
  28. Thank you so much for your data! I am interested in working with the player data from 2009 to present, and all together these excel sheets amount to over 60 thousand players. I was wondering if I can access a .csv file specifically for high-ranking prospects (or in other words, some sort of filter to look at only players that were drafted) or if I would have to do that manually. Thanks!

    ReplyDelete
    Replies
    1. You can pull the table from this: https://barttorvik.com/playerstat.php?link=y&sIndex=45&minGP=15&sortToggle=1&minpick=60&year=all&start=-11101&end=all0501&pickSelect=-1&erk=1500

      Delete
  29. Hello Bart,

    I was curious if you had an updated schedule spreadsheet for this coming CBB (Men's) season and if you do where one can find it?

    Thank you in advance, Sam!

    ReplyDelete
  30. Hello Bart,

    Huge Fan and I love the site that you have created! I wanted to reach out to see if you had the 2022 schedule in a .csv format that one could download from the website? Thank you!

    ReplyDelete
  31. Hey Bart, absolutely love this site! 2 questions for you.

    First, I can pull individual games stats from YEAR_all_advgames.json for every year 2008-2021, but nothing is popping up for 2022. Is this just a delay thing?

    Second, is there a way to pull a csv for allrostersYY.php? I've tried allrosters19.php?&csv=1 for example, but no dice.

    Thanks a ton, and excited to follow for another season!

    ReplyDelete
  32. Hi Bart,

    Have you ever done some backtesting of your model accuracy vs. Vegas opening and closing lines? I hope you don't mind, but I've been trying to run this analysis on 2 year's worth of historical data and have found some interesting results. It does appear that there's a positive correlation between the magnitude of the discrepancy between your line and Vegas, and the likelihood that your line is closer to the final result. This appears to be especially strong in November/December. As the season progresses however, the frequency and magnitude of prediction discrepancy progressively reduces, and accuracy performance vs. Vegas becomes more random, especially when the magnitude of prediction discrepancy is low. While the frequency of large prediction discrepancies drops dramatically late in the season, the predictive power seems to become stronger.

    This has all been in an endeavor to find a sweet spot of when/where betting based on your predictions is profitable. Curious to see if you've found similar trends?

    ReplyDelete
    Replies
    1. Hello - that is interesting stuff. I have not tracked betting lines prior to this year, just haven't had the data. I also always figured it would be kind of brutal because (1) sports books have access to all kinds of additional useful stuff including actual betting data (wisdom of the crowd) and can respond to injuries in ways my model does not. (Not to mention they have access to my site!)

      But I have been tracking them this year and have been pleasantly surprised. In games where the discrepancy is more than 3.0 points, at this moment taking the T-Rank suggestion would have you 96-69-5 which seems prettay good. It seems the preseaon projections were unusually good this year, which is gratifying (because they are the result of a shit-ton of effort.)

      That said, I don't recommend anyone use my site for gambling purposes and certainly make no warranties.

      Delete
    2. Yes, 58% success rate against the spread is phenomenal, and if it is repeatable over a large sample is something that professional gamblers salivate over. The 3 to 4 point discrepancy seems to be a key inflection point based on my analysis as well. When you say that you are tracking betting lines starting this year, are you doing that manually and can we find it on your site anywhere or is it available to download like the other stats files you've pointed to in these threads?

      Much respect to the hard work you have put into this project!

      Delete
  33. Hey Mr. Bart, I am familiar with pulling simple tables off of websites into excel and google sheets but I am having trouble with the Team Table which has some amazing information. Thanks for all your hard work and would love to know if you could help me get this table, specifically for stats like the height and experience.

    ReplyDelete
    Replies
    1. Hello, it should work to add "csv=1" to the URL parameters. E.g., team-tables_each.php?csv=1 will allow you to save the data in the table.

      Delete
  34. What are the hurdles that you see to adding a feature to account for injuries in future seasons? If it's something you're interested in, I would love to try and help accomplish it.

    ReplyDelete
    Replies
    1. First, there would have to be pretty robust model for what effect to give to an injury. I don't have that, and it's not something I really have any good ideas on. (I only have my "highly dubious missing player analysis.").

      Second, I would need highly detailed and accessible information about injuries for all 358 teams. (Because for this to be useful it would ideally be prospective, and for retrospective usefulness a DNP-coach's decision is qualitatively different that a DNP-injury.)

      Both of those are pretty much dealbreakers for me.

      One thing I've considered, which I may look into this offseason, is figuring out a way to incorporate betting lines into the ratings, since really provide the best information about injuries that I can think of.

      Delete
    2. Just my 2 cents, but I think you will find that incorporating betting lines is not a good adjustment for injuries. It may be a powerful indicator late in the season, but early in the season when there's very few injuries it will have a detrimental impact. Your ratings do so well early in the season it would be a shame to tinker with that.

      Delete
    3. Thanks for the input. Realistically I probably never will incorporate betting lines, just something I've thought about.

      Delete
  35. Hi Bart,

    I was looking for shooting split data separated into close 2s, long 2s, and threes like you have on your site here https://barttorvik.com/teampbp.php?year=2021&conlimit=&sort=1 for a school project for the years 2017-2021, but I was having trouble downloading it as a cvs file. I would really appreciate it if you could provide directions for how to download the shooting split data. Thank you!

    ReplyDelete
    Replies
    1. Hello, there are files at YEAR_pbp_teamsstats.json - sorry don't have those easily accessible in CSV format but hopefully you can work with the json. Also, if you're just looking for those five years it's pretty easy to just pull the tables from the page into Excel, whether through a simply copy & paste or through an extension like Table Capture (https://chrome.google.com/webstore/detail/table-capture/iebpjdmgckacbodjpijphcplhebcmeop?hl=en)

      Delete
    2. Thank you for the advice. I was able to get the data into an excel file using the table capture tool. Appreciate it!

      Delete
  36. Hi Bart,

    First off, this website is sensational. Thanks for all that you do with it and for your responsiveness!

    Second, I am working on a school project where we are planning to run a logistical regression model to predict what teams will make the NCAA tournament based on projection data. I'm not trying to go too crazy with this, but there are two important features that we will definitely need -- returning player data and incoming transfer data.

    Starting with returning player data, I found returning possession % and returning minutes % at this link: https://barttorvik.com/trankpure17.php?. However, this only goes back through 2017. I know data for returning minutes % goes back through 2009 because I can find it at this link: https://barttorvik.com/program-maps.php?tvalue=Virginia&year=2022&sort=&t2value=None&avg=all&top=0&quad=4&venue=All&type=All&xax=37&yax=3 . Is there a link I can go to to get this data for each season without having to scrape it myself from these team charts? Ditto for returning poss. % if possible. Lastly on returning player data, I know for a fact I saw returning points % as well, but I cannot find that stat anywhere. To summarize what I am looking for in terms of returning player data, it is as follows:
    1. Returning minutes %
    2. Returning possession %
    3. Returning points %
    for all teams from 2010-present. Is this possible? Even one of these stats would be extraordinarily helpful, but several or all would be better.

    As for transfer player data, this seems a bit more tricky. In one of your player stat pages (https://barttorvik.com/playerstat.php?link=y&xvalue=trans&year=2021) you linked a nice website that lists all known transfers (https://verbalcommits.com/transfers/2012). It wouldn't be a perfect solution since it is often unclear whether the player was immediately eligible or not, and it would require a decent amount of preprocessing, but it is a possibility. But more ideally, I was hoping you had some sort "one-number" stat that rates the incoming transfers for a particular team in a particular season. I would imagine you had something like that for projection purposes, but I am not sure where to find it.

    Thanks in advance for your help!

    ReplyDelete
  37. Can you tell me where to find each team's adjusted tempo? Most of the team data are in https://barttorvik.com/YEAR_fffinal.csv, but I didn't see tempo stats there. Thanks!

    ReplyDelete
  38. Hey,

    I really appreciate all the information you have provided. I am trying to download 2019-2022(current year) metrics and having trouble downloading it as a csv for a school project. All I am looking for is the data that's provided on the home page.
    Thanks!

    ReplyDelete
    Replies
    1. Hi Cameron. You could try adding &csv=1 to the URL on the main page, should get you most of the info in a CSV although you'll have to do some manual manipulation of the resulting file and you'll have to figure out which columns are which. All the stats are also available directly in files at YEAR_fffinal.csv and YEAR_team_results.csv

      Delete
  39. Hello Bart, where could I find a list of the oldest to youngest teams in D1 BB, for the 2021-22 season? Thank you, sir!

    ReplyDelete
    Replies
    1. I'm not sure - I do not have/publish age data on the site. I do keep track of an "experience" stat that's based on class year, and you can look at that on the Team Charts page or the Team Tables page.

      Delete
  40. Hello Bart, do you have any downloadable data on home/road splits? Thank you sir for the work. Love the numbers.

    ReplyDelete
    Replies
    1. Nothing prefabricated but if you filter to home/road splits on the main page and then add &csv=1 or &json=1 to the URL parameters, you get the data.

      Delete
  41. Thanks again. I tried adding &csv=1 to the end of the URL, but it didn't download. Is the home split URL "https://barttorvik.com/trank.php?year=2022&sort=&hteam=&t2value=&conlimit=All&state=All&begin=20211101&end=20220501&top=0&revquad=0&quad=5&venue=H&type=All&mingames=0#" ?

    ReplyDelete
  42. Mr. Torvik, Got it. Thanks again!

    ReplyDelete
  43. Mr. Torvik, one last thing. Is there a way to find home/road FT%? Only the FTRate is included in the home page splits. Thank you.

    ReplyDelete
    Replies
    1. You can look at it for individual teams by using the filters on the team pages and for all teams at once on the team_tables page: https://barttorvik.com/team-tables_each.php

      Delete
  44. Hi Bart,

    Is it possible to download Teamsheet data over a given time range. I'm interested in looking at partial season F.U.N. data, but I can only find daily snapshots. Would it be possible to find a team's F.U.N. from, say, 1/15/2020 - 5/1/2020?

    ReplyDelete
    Replies
    1. Unfortunately that's not something I've got set up to work.

      Delete
  45. Hello! Thank you for yo TIAur hard work and dedication. I was wondering if you had this same type of website but, for NBA?

    ReplyDelete
  46. Is a CSV or JSON of the transfer stats available anywhere, perhaps? I tried adding “&csv=1,” etc., to the url and returned all player stats, not transfers only. Thanks in advance!

    ReplyDelete
    Replies
    1. The transfer stats page is created dynamically by just excluding guys not on the transfer list. Though obviously you can just copy the table.

      Delete
  47. Is there any way to extract all of the season stats available, not just those on the team stats page. I would like to see 3pt attempts, total assists, etc?

    ReplyDelete
    Replies
    1. Not really from my site in a ready format - obviously those kinds of counting stats are generally available and not really what what my site is focused on.

      Delete
  48. Hey Bart,
    Appreciate all you have done with the website. I am trying to get all regular season data from 2008 to now from the team tables. As I'm trying to predict ncaa tournament success and don't want tournament data included. Where would I place the csv=1 in the url to grab this data.

    ReplyDelete
    Replies
    1. You would change the "type" filter to "Regular Season" and then add the "&csv=1" to the resulting URL

      Delete
  49. Hello, I am using your player stats for a project, thank you very much for the quality data you provided. I got a question about the feature "Min_per". What exactly is this feature, I think it is the percentage of the total minutes of the games which the player get a chance to play. Please correct me if I am wrong.

    ReplyDelete
    Replies
    1. Hello, yes it's the player's percentage of available team minutes played. Available minutes are total team minutes divided by 5.

      Delete
  50. Hello Bart, I have been working with your player data for a machine learning project, while I was analyzing the features, I encountered a feature named rec rank. what is that feature stands for? could you please give info about that? Thank you very much for the data you provided.

    ReplyDelete
    Replies
    1. Hello that is short for "recruiting rank" i.e. what the player was ranked as a recruit coming out of high school.

      Delete
  51. Hi Bart, in your player data, There are features named as "rimmade" and "midmade" what are they mean? Thanks a lot.

    ReplyDelete
    Replies
    1. rimmade = shots made at or near the rim; midmade = two point shots that were not made at or near the rim (i.e., midrange)

      Delete
  52. Hello Bart, I am using your player data for a machine learning project. I am examining the features at the moment. I have a question about the feature called gbpm, I know what bpm is, but could not figure out what gbpm is. Can you inform me about that? Thanks a lot.

    ReplyDelete
    Replies
    1. Hello. I am not very careful with these internal labels since they only need to make sense to me. And there are some vestigial/superseded stats that for logistical reasons still get tracked & produced. In the data you are looking at, I believe "bpm" is the *original* version of Daniel Myers's BPM, and "gbpm" is the revised & updated version (BPM 2.0). I originally labeled it GBPM because when Daniel was developing it one of his main goals was to remove non-linear variables, which would make it suitable for use on small samples, including a single game (thus: "game box plus minus," GBPM).

      Delete
    2. Thanks a lot, you are the man.

      Delete