Package internetarchive

Similar documents
Package cattonum. R topics documented: May 2, Type Package Version Title Encode Categorical Features

Package calpassapi. August 25, 2018

Package SEMrushR. November 3, 2018

Package robotstxt. November 12, 2017

Package ECctmc. May 1, 2018

Package goodpractice

Package githubinstall

Package rwars. January 14, 2017

Package knitrprogressbar

Package pdfsearch. July 10, 2018

Package messaging. May 27, 2018

Package fastdummies. January 8, 2018

Package facerec. May 14, 2018

Package fitbitscraper

Package ezknitr. September 16, 2016

Package validara. October 19, 2017

Package available. November 17, 2017

Package dbx. July 5, 2018

Package statsdk. September 30, 2017

Package censusapi. August 19, 2018

Package rzeit2. January 7, 2019

Package guardianapi. February 3, 2019

Package opencage. January 16, 2018

Package virustotal. May 1, 2017

Package dkanr. July 12, 2018

Package rgho. R topics documented: January 18, 2017

Package censusr. R topics documented: June 14, Type Package Title Collect Data from the Census API Version 0.0.

Package datasets.load

Package data.world. April 5, 2018

Package fingertipsr. May 25, Type Package Version Title Fingertips Data for Public Health

Package docxtools. July 6, 2018

Package gtrendsr. October 19, 2017

Package spelling. December 18, 2017

Package essurvey. August 23, 2018

Package postgistools

Package patentsview. July 12, 2017

Package WordR. September 7, 2017

Package rprojroot. January 3, Title Finding Files in Project Subdirectories Version 1.3-2

Package jpmesh. December 4, 2017

Package rnaturalearth

Package msgtools. March 2, 2017

Package triebeard. August 29, 2016

Package qualmap. R topics documented: September 12, Type Package

Package semver. January 6, 2017

Package tabulizer. June 7, 2018

Package bisect. April 16, 2018

Package canvasxpress

Package lumberjack. R topics documented: July 20, 2018

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3.

Package bigqueryr. October 23, 2017

Package repec. August 31, 2018

Package algorithmia. September 13, 2016

Package condusco. November 8, 2017

Package oec. R topics documented: May 11, Type Package

Package rbraries. April 18, 2018

Package wikitaxa. December 21, 2017

Package loggit. April 9, 2018

Package SNPediaR. April 9, 2019

Package gtrendsr. August 4, 2018

Package edfreader. R topics documented: May 21, 2017

Package smapr. October 20, 2017

Package barcoder. October 26, 2018

Package geniusr. December 6, 2017

Package cancensus. February 4, 2018

Package benchmarkme. R topics documented: February 26, Type Package Title Crowd Sourced System Benchmarks Version 0.2.

Package modules. July 22, 2017

Package rcmdcheck. R topics documented: November 10, 2018

Package ezsummary. August 29, 2016

Package kdtools. April 26, 2018

Package httpcache. October 17, 2017

Package widyr. August 14, 2017

Package texpreview. August 15, 2018

Package sfc. August 29, 2016

Package TrafficBDE. March 1, 2018

Package aws.transcribe

Package climber. R topics documented:

Package ggimage. R topics documented: December 5, Title Use Image in 'ggplot2' Version 0.1.0

Package cregulome. September 13, 2018

Package reval. May 26, 2015

Package GetITRData. October 22, 2017

Package wdman. January 23, 2017

Package carbonate. R topics documented: October 14, 2018

Package ssh. June 4, 2018

R topics documented: 2 checkpoint-package

Package vinereg. August 10, 2018

Package profvis. R topics documented:

Package nngeo. September 29, 2018

Package BiocManager. November 13, 2018

Package fastqcr. April 11, 2017

Package ggimage. R topics documented: November 1, Title Use Image in 'ggplot2' Version 0.0.7

Package bigqueryr. June 8, 2018

Package resumer. R topics documented: August 29, 2016

Package mdftracks. February 6, 2017

Package tfruns. February 18, 2018

Package RODBCext. July 31, 2017

Package wrswor. R topics documented: February 2, Type Package

Package stapler. November 27, 2017

Package spark. July 21, 2017

Package deductive. June 2, 2017

Package liftr. R topics documented: May 14, Type Package

Transcription:

Type Package Title An API Client for the Internet Archive Package internetarchive December 8, 2016 Search the Internet Archive, retrieve metadata, and download files. Version 0.1.6 Date 2016-12-08 License MIT + file LICENSE LazyData TRUE URL https://github.com/ropensci/internetarchive BugReports https://github.com/ropensci/internetarchive/issues Depends R (>= 3.1.0) Imports httr (>= 1.1.0), dplyr (>= 0.4.3) Suggests knitr, testthat, rmarkdown VignetteBuilder knitr RoxygenNote 5.0.1 NeedsCompilation no Author Lincoln Mullen [aut, cre] Maintainer Lincoln Mullen <lincoln@lincolnmullen.com> Repository CRAN Date/Publication 2016-12-08 20:38:21 R topics documented: ia_browse.......................................... 2 ia_download......................................... 2 ia_files............................................ 3 ia_get_items......................................... 4 ia_item_id.......................................... 5 ia_keyword_search..................................... 5 1

2 ia_download ia_list_fields......................................... 6 ia_metadata......................................... 6 ia_search.......................................... 7 internetarchive....................................... 8 Index 9 ia_browse Open an Internet Archive item in the browser Open an Internet Archive item in the browser ia_browse(item_id, type = c("details", "stream")) item_id type The item identifier. If multiple item identifiers are passed in, only the first will be opened. Which page to open: details is the metadata page, stream is the viewing page for items which are associated with a PDF. Returns the item ID(s) passed to the function. # Distinguished Converts to Rome in America ia_browse("distinguishedcon00scanuoft") ia_download Download files for Internet Archive items. Download files for Internet Archive items. ia_download(files, dir = ".", extended_name = TRUE, overwrite = FALSE, silence = FALSE)

ia_files 3 files dir extended_name overwrite silence A data frame of files returned by ia_files. You should filter this data frame to download only the files that you actually want. The directory in which to save the downloaded files. If this argument is FALSE, then the downloaded file will have a filename in the following format: itemidentifier.extension, e.g., thedamnationofth00133gut.txt. If there are multiple files of the same file type for an item, then the file names will not be unique. If this argument is TRUE, them the downloaded file will have a filename in the following format: itemidentifier-original-filename.extension, e.g., thedamnationofth00133gut-133.txt. If TRUE, this function will download all files and overwrite them on disk if they have already been downloaded. If FALSE, then if a file already exists on disk it will not be downloaded again but other downloads will proceed normally. If false, print the item IDs as they are downloaded. A data frame including the file names of the downloaded files. ## Not run: if (require(dplyr)) { dir <- tempdir() ia_get_items("thedamnationofth00133gut") %>% ia_files() %>% filter(type == "txt") %>% # download only the files we want ia_download(dir = dir, extended_name = FALSE) } ## End(Not run) ia_files Access the list of files associated with an Internet Archive item Access the list of files associated with an Internet Archive item ia_files(items) items A list describing an Internet Archive items returned from the API.

4 ia_get_items A list containing the files as a list of character vectors. ## Not run: ats_query <- c("publisher" = "american tract society") ids <- ia_search(ats_query, num_results = 3) items <- ia_get_items(ids) files <- ia_files(items) files ## End(Not run) ia_get_items Get the metadata for Internet Archive items Get the metadata for Internet Archive items ia_get_items(item_id, silence = FALSE) item_id silence A character vector containing the ID for an Internet Archive item. This argument is vectorized, so you can retrieve multiple items at once. If false, print the item IDs as they are retrieved. A list containing the metadata returned by the API. List names correspond to the item IDs. ## Not run: ia_get_items("thedamnationofth00133gut") ats_query <- c("publisher" = "american tract society") ids <- ia_search(ats_query, num_results = 2) ia_get_items(ids) ## End(Not run)

ia_item_id 5 ia_item_id Access the item IDs from an Internet Archive items Access the item IDs from an Internet Archive items ia_item_id(item) item A list describing an Internet Archive items returned from the API. This argument is vectorized. A character vector containing the item IDs. ats_query <- c("publisher" = "american tract society") ids <- ia_search(ats_query, num_results = 3) items <- ia_get_items(ids) ia_item_id(items) ia_keyword_search Perform an simple keyword search of the Internet Archive. Perform an simple keyword search of the Internet Archive. ia_keyword_search(keywords, num_results = 5, page = 1, print_total = TRUE) keywords num_results page print_total The keywords to search for. The number of results to return per page. When results are paged, which page of results to return. Should the total number of results for this query be printed as a message?

6 ia_metadata A character vector of Internet Archive item IDs. ia_keyword_search("isaac hecker", num_results = 20) ia_list_fields List accepted metadata fields List accepted metadata fields ia_list_fields() A list of the accepted metadata fields ia_list_fields() ia_metadata Access the item metadata from an Internet Archive item Access the item metadata from an Internet Archive item ia_metadata(items) items A list object describing an Internet Archive items returned from the API. A data frame containing the metadata, with columns id for the item identifier, field for the name of the metadata field, and value for the metadata values.

ia_search 7 ats_query <- c("publisher" = "american tract society") ids <- ia_search(ats_query, num_results = 3) items <- ia_get_items(ids) metadata <- ia_metadata(items) metadata ia_search Search the Internet Archive Perform an advanced search of the Internet Archive, specifying which metadata fields to search. Note that all searches are in the form of "contains," i.e., the title contains the search term. ia_search(terms, num_results = 5, page = 1, print_url = FALSE, print_total = TRUE) terms num_results page print_url print_total A set of metadata fields and corresponding values to search. These should take the form of a named character vector. The number of results to return per page. When results are paged, which page of results to return. Should the URL used for the query be printed as a message? Should the total number of results for this query be printed as a message? A character vector of Internet Archive item IDs. References See the documentation on the Internet Archive s advanced search page. query1 <- c("title" = "damnation of theron ware") ia_search(query1) query2 <- c("title" = "damnation of theron ware", "contributor" = "gutenberg") ia_search(query2)

8 internetarchive internetarchive Client for the Internet Archive API This client permits you to search (ia_search), retrieve item metadata (ia_metadata) and associated files (ia_files), and download files (ia_files) in a pipeable interface.

Index ia_browse, 2 ia_download, 2 ia_files, 3, 3, 8 ia_get_items, 4 ia_item_id, 5 ia_keyword_search, 5 ia_list_fields, 6 ia_metadata, 6, 8 ia_search, 7, 8 internetarchive, 8 9