XML : Extracting html table with rowspan values in R

The dataframe I create with the following code puts the three letter team abbreviation into only the first row that it spans. Is there another package or additional code I can add to keep the data in the proper column? Thanks for your help.

  url <- "https://en.wikipedia.org/wiki/List_of_Major_League_Baseball_postseason_teams"  url_source <- readLines(url, encoding = "UTF-8")  playoffs <- data.frame(readHTMLTable(url_source, stringsAsFactors = F, header = T) [2])    

No comments:

Post a Comment