Package rtext. January 23, 2019

Similar documents
Package crossword.r. January 19, 2018

Package stringb. November 1, 2016

Package ECctmc. May 1, 2018

Package triebeard. August 29, 2016

Package cattonum. R topics documented: May 2, Type Package Version Title Encode Categorical Features

Package validara. October 19, 2017

Package robotstxt. November 12, 2017

Package pdfsearch. July 10, 2018

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3.

Package nngeo. September 29, 2018

Package strat. November 23, 2016

Package messaging. May 27, 2018

Package semver. January 6, 2017

Package fastdummies. January 8, 2018

Package dbx. July 5, 2018

Package readobj. November 2, 2017

Package kdtools. April 26, 2018

Package bigreadr. R topics documented: August 13, Version Date Title Read Large Text Files

Package canvasxpress

Package weco. May 4, 2018

Package profvis. R topics documented:

Package spark. July 21, 2017

Package loggit. April 9, 2018

Package oec. R topics documented: May 11, Type Package

Package geogrid. August 19, 2018

R topics documented: 2 clone

Package fastrtext. December 10, 2017

Package lumberjack. R topics documented: July 20, 2018

Package gtrendsr. August 4, 2018

Package IRkernel. January 7, 2019

Package eply. April 6, 2018

Package githubinstall

Package gtrendsr. October 19, 2017

Package ggimage. R topics documented: December 5, Title Use Image in 'ggplot2' Version 0.1.0

Package AhoCorasickTrie

Package readxl. April 18, 2017

Package docxtools. July 6, 2018

Package datasets.load

Package wrswor. R topics documented: February 2, Type Package

Package goodpractice

Package gggenes. R topics documented: November 7, Title Draw Gene Arrow Maps in 'ggplot2' Version 0.3.2

Package tidyselect. October 11, 2018

Package ggimage. R topics documented: November 1, Title Use Image in 'ggplot2' Version 0.0.7

Package slickr. March 6, 2018

Package knitrprogressbar

Package taxizedb. June 21, 2017

Package hyphenatr. August 29, 2016

Package calpassapi. August 25, 2018

Package projector. February 27, 2018

Package deductive. June 2, 2017

Package virustotal. May 1, 2017

Package milr. June 8, 2017

Package fastqcr. April 11, 2017

Package combiter. December 4, 2017

Package fbroc. August 29, 2016

Package bikedata. April 27, 2018

Package qualmap. R topics documented: September 12, Type Package

Package QCAtools. January 3, 2017

Package modules. July 22, 2017

Package datapasta. January 24, 2018

Package textrecipes. December 18, 2018

Package ompr. November 18, 2017

Package fst. December 18, 2017

Package vip. June 15, 2018

Package splithalf. March 17, 2018

Package rgho. R topics documented: January 18, 2017

Package narray. January 28, 2018

Package reconstructr

Package iccbeta. January 28, 2019

Package understandbpmn

Package embed. November 19, 2018

Package coga. May 8, 2018

Package jdx. R topics documented: January 9, Type Package Title 'Java' Data Exchange for 'R' and 'rjava'

Package humanize. R topics documented: April 4, Version Title Create Values for Human Consumption

Package fitbitscraper

Package rzeit2. January 7, 2019

Package pwrab. R topics documented: June 6, Type Package Title Power Analysis for AB Testing Version 0.1.0

Package jpmesh. December 4, 2017

Package tibble. August 22, 2017

Package shinyfeedback

Package rprojroot. January 3, Title Finding Files in Project Subdirectories Version 1.3-2

Package crochet. January 8, 2018

Package d3plus. September 25, 2017

Package SEMrushR. November 3, 2018

Package ggseas. June 12, 2018

Package filematrix. R topics documented: February 27, Type Package

Package essurvey. August 23, 2018

Package fasttextr. May 12, 2017

Package queuecomputer

Package sankey. R topics documented: October 22, 2017

Package wdman. January 23, 2017

Package mdftracks. February 6, 2017

Package censusr. R topics documented: June 14, Type Package Title Collect Data from the Census API Version 0.0.

Package haven. April 9, 2015

Package lexrankr. December 13, 2017

Package GetITRData. October 22, 2017

Package regexselect. R topics documented: September 22, Version Date Title Regular Expressions in 'shiny' Select Lists

Package dkanr. July 12, 2018

Package ecoseries. R topics documented: September 27, 2017

Package labelvector. July 28, 2018

Transcription:

Title R6 Objects for Text and Data Date 2019-01-21 Version 0.1.21 Package January 23, 2019 For natural language processing and analysis of qualitative text coding structures which provide a way to bind together text and text data are fundamental. The package provides such a structure and accompanying methods in form of R6 objects. The '' class allows for text handling and text coding (character or regex based) including data updates on text transformations as well as aggregation on various levels. Furthermore, the usage of R6 enables inheritance and passing by reference which should enable '' instances to be used as back-end for R based graphical text editors or text coding GUIs. Depends R (>= 3.0.0), stringb (>= 0.1.13) License MIT + file LICENSE LazyData TRUE Imports R6 (>= 2.1.2), hellno (>= 0.0.1), magrittr (>= 1.5), Rcpp (>= 0.12.5), digest (>= 0.6.9), RSQLite (>= 1.0.0), stats, graphics Suggests testthat, knitr, rmarkdown BugReports https://github.com/petermeissner//issues URL https://github.com/petermeissner/ RoxygenNote 6.1.1 LinkingTo Rcpp NeedsCompilation yes Author Peter Meissner [aut, cre], Ulrich Sieberer [cph], University of Konstanz [cph] Maintainer Peter Meissner <retep.meissner@gmail.com> Repository CRAN Date/Publication 2019-01-23 22:00:03 UTC 1

2 plot. R topics documented: modus............................................ 2 plot............................................ 2 prometheus_early...................................... 3 prometheus_late....................................... 3 R6 extended..................................... 4............................................. 4 _base.......................................... 6 _export......................................... 7 _loadsave........................................ 7 _tokenize........................................ 8 text_tokenize....................................... 8 Index 10 modus function giving back the mode function giving back the mode modus(x, multimodal = FALSE, warn = TRUE) Arguments x multimodal warn vector to get mode for wether or not all modes should be returned in case of more than one should the function warn about multimodal outcomes? plot. function for plotting function for plotting ## S3 method for class '' plot(x, y = NULL, lines = TRUE, col = "#ED4C4CA0", add = FALSE,...)

prometheus_early 3 Arguments x object of class y char_data to be plotted lines vector of integer listing the lines to be plottted col color of the char_data variable to be highlighted add add data to an already existing plot?... further parameters passed through to initial plot prometheus_early prometheus early version prometheus early version prometheus_early An object of class character of length 1. Source https://de.wikisource.org/w/index.php?title=prometheus_(gedicht,_fr prometheus_late prometheus late version prometheus late version prometheus_late An object of class character of length 1. Source https://de.wikisource.org/w/index.php?title=prometheus_(gedicht,_sp

4 R6 extended extended R6 class extended R6 class R6 extended R6Class object. See Also An R6Class generator object.

5 The class family Rtext consists of an set of R6 classes that are conencted by inheritance. Each class handles a different set of functionalities that are - despite needing the data structure provided by _base - independent. R6 extended A class that has nothing to do per se with but merely adds some basic features to the base R6 class (debugging, hashing, getting fields and handling warnings and messages as well as listing content) _base [inherits from R6 extended] The foundation of the class. This class allows to load and store text, its meta data, as well as data about the text in a character by character level. _loadsave [inherits from _base] Adds load and save methods for loading and saving objects (text and data) into/from Rdata files. _export [inherits from _loadsave] Adds methods to import and export from and to SQLite databases - like load and save but for SQLite. _tokenize [inherits from _export] Adds methods to aggregate character level data onto token level. (the text itself can be tokenized via S3 methods from the stringb package - e.g. text_tokenize_words()) [inherits from _tokenize] Adds no new features at all but is just a handy label sitting on top of all the functionality provided by the inheritance chain. Examples # initialize (with text or file) quote_text <- "Outside of a dog, a book is man's best friend. Inside of a dog it's too dark to read." quote <- $new(text = quote_text) # add some data quote$char_data_set("first", 1, TRUE) quote$char_data_set("last", quote$char_length(), TRUE) # get the data quote$char_data_get() # transform text quote$char_add("[this is an insertion] \n", 47) # get the data again (see, the data moved along with the text) quote$text_get() quote$char_data_get() # do some convenience coding (via regular expressions) quote$char_data_set_regex("dog_friend", "dog", "dog") quote$char_data_set_regex("dog_friend", "friend", "friend") quote$char_data_get() # aggregate data by regex pattern

6 _base quote$tokenize_data_regex(split="(dog) (friend)", non_token = TRUE, join = "full") # aggregate data by words quote$tokenize_data_words(non_token = TRUE, join="full") # aggregate data by lines quote$tokenize_data_lines() # plotting and data highlighting plot(quote, "dog_friend") # adding further data to the plot plot(quote, "dog_friend") plot(quote, "first", col="steelblue", add=true) plot(quote, "last", col="steelblue", add=true) _base _base : basic workhorse for _base : basic workhorse for _base R6Class object. See Also

_export 7 _export _export R6Class object. See Also _loadsave R6 class - load and save methods for R6 class - load and save methods for _loadsave R6Class object. See Also

8 text_tokenize. _tokenize _tokenize R6Class object. See Also text_tokenize. function tokenizing objects function tokenizing objects ## S3 method for class '' text_tokenize(string, regex = NULL, ignore.case = FALSE, fixed = FALSE, perl = FALSE, usebytes = FALSE, non_token = FALSE) Arguments string regex ignore.case fixed perl text to be tokenized regex expressing where to cut see (see gregexpr) whether or not reges should be case sensitive (see gregexpr) whether or not regex should be interpreted as is or as regular expression (see gregexpr) whether or not Perl compatible regex should be used (see gregexpr)

text_tokenize. 9 usebytes non_token byte-by-byte matching of regex or character-by-character (see gregexpr) should information for non-token, i.e. those patterns by which the text was splitted, be returned as well

Index Topic datasets prometheus_early, 3 prometheus_late, 3 Topic data R6 extended, 4, 4 _base, 6 _export, 7 _loadsave, 7 _tokenize, 8 gregexpr, 8, 9 modus, 2 plot., 2 prometheus_early, 3 prometheus_late, 3 R6 extended, 4 R6Class, 4, 6 8, 4, 4, 6 8 _base, 6 _export, 7 _loadsave, 7 _tokenize, 8 text_tokenize., 8 10