RapidsDB. RapidsDB User Guide. Release 3.0. Beta

Size: px

Start display at page:

Download "RapidsDB. RapidsDB User Guide. Release 3.0. Beta"

Theodora Harrison
5 years ago
Views:

1 RapidsDB RapidsDB User Guide Release 3.0 Beta

2 RapidsDB User Guide Table of Contents 1. Overview What is RapidsDB? RapidsDB Components SQL Compiler and Optimizer MPP Execution Engine Federated Connectors RapidsSE Client API Zookeeper RapidsDB Cluster Topology Federations, Connectors and Naming Overview Connectors Table Naming Name Resolution Importing Tables Mapping Catalog, Schema and Table Names Query Interfaces RapidsDB shell Invoking the RapidsDB shell RapidsDB shell Syntax RapidsDB shell: EXPLAIN Simple SELECT Aggregate Query way JOIN RapidsDB shell: STATS Page 1 Borrui Data Technology Co. Ltd 2016

3 Simple SELECT Aggregate Query way JOIN Programmatic API RapidsDB shell Scripting Mode Invoking the RapidsDB shell Programmatically from Java SQL Syntax Lexical Structure Identifiers and Keywords Constants String Constants Numeric Constants Operators Special Characters Comments Operator Precedence Data Types and Type Specifiers Data Types Type Specifiers Use in CAST Use in Column Definitions System Metadata Internal Precision Value Expressions Column References Operator Invocation Function Call Aggregate Expression Type Cast Decimal Expressions and Precision Scalar Subquery Expression Evaluation Rules Page 2 Borrui Data Technology Co. Ltd 2016

4 5. Queries Overview Table Expressions The FROM Clause Joined Tables CROSS JOIN INNER JOIN LEFT OUTER JOIN RIGHT OUTER JOIN ON Clause USING Clause Table and Column Aliases Subqueries WHERE Clause GROUP BY and HAVING Clause SELECT Lists SELECT List Items Column Labels DISTINCT Combining Queries ORDER BY LIMIT and OFFSET Functions and Operators Logical Operators Comparison Operators and BETWEEN Mathematical Operators and Functions String Functions and Operators Pattern Matching LIKE Date/Time Functions EXTRACT(from timestamp) CURRENT_TIMESTAMP NOW() Page 3 Borrui Data Technology Co. Ltd 2016

5 6.6.4 Interval Arithmetic Interval Types YEAR-MONTH interval: DAY-TIME interval: Support for Interval Arithmetic: EXTRACT(from interval) String to Interval Functions BETWEEN Operator: CONDITIONAL EXPRESSIONS CASE COALESCE IF IFNULL NULLIF AGGREGATE FUNCTIONS SUB-QUERY EXPRESSIONS IN NOT IN EXISTS QUERY EXECUTION RapidsDB SQL Statement Execution Partitioned Query Plans Non-Partitioned Query Plans Combination of Partitioned and Non-Partitioned Plans RapidsDB Join Algorithms Bloom Join Hash Join INSERT DDL CREATE TABLE DROP TABLE REFRESH Command Page 4 Borrui Data Technology Co. Ltd 2016

6 11. SYSTEM METADATA TABLES OVERVIEW NODES Table FEDERATIONS Table CONNECTORS Table CATALOGS Table SCHEMAS Table TABLES Table COLUMNS Table TABLE_PROVIDERS Table Performance Tuning Using EXPLAIN AND STATS JOIN Order JOIN Hints Restrict Amount of Data Error Messages RapidsDB shell Messages Query Rejection Messages Data Store-Related Messages Appendix A Sample Scripter Program Appendix B SQL Grammar... 0 Page 5 Borrui Data Technology Co. Ltd 2016

7 1. Overview 1.1 What is RapidsDB? RapidsDB is a fully parallel, distributed, in-memory federated query system that is designed to support complex analytical SQL queries running against a set of different data stores. The diagram below shows the major components of the RapidsDB system: Figure 1. RapidsDB architecture RapidsDB provides unified SQL access to a wide variety of data sources, which can include relational and non-relational data sources. Data can be joined across all of the data sources. For Release 3.0 Beta, RapidsDB supports access to its own internal in-memory data store, RapidsSE, the MemSQL ( and VoltDB ( in-memory data stores and also supports access to streaming data sources. For the next release, support will be provided for any data sources that support a JDBC interface, as well as supporting access to data that is stored in Hadoop HDFS. Page 6 Borrui Data Technology Co. Ltd 2016

8 1.2 RapidsDB Components SQL Compiler and Optimizer RapidsDB has an advanced SQL Compiler & Optimizer that is responsible for taking a user s SQL query and building a query plan that will take full advantage of the native SQL capabilities of the underlying data source. The generated query plan will push down those parts of the query plan that can be executed directly by the data source and then execute the remainder of the plan using the RapidsDB MPP Execution Engine by only pulling up the data from the underlying data sources that is needed to complete the execution of the query MPP Execution Engine RapidsDB has its own fully parallel, MPP Execution Engine that is responsible for execution of the query plan generated by the RapidsDB SQL Compiler & Optimizer. The MPP Execution Engine will access the underlying data sources using the Federated Connectors Federated Connectors RapidsDB supports a plug-in Data Connector technology for interfacing with the underlying data sources. The Connectors provide a standard ANSI 3-part naming interface (catalog.schema.table) to the data that is managed by the data sources. For those data sources that do not support ANSI 3-part naming, the Connector will provide the missing parts of the ANSI 3-part name. For example, for a streaming data source the Stream Connector will provide a catalog and schema name. During query execution, the Connectors are responsible for executing those parts of the plan that the Optimizer has generated for the associated data store, and then passing up the results of the query execution to the RapidsDB MPP Execution Engine RapidsSE RapidsSE is an in-memory storage engine that is tightly integrated with the RapidsDB Execution Engine. The data managed by RapidsSE is stored in a shared memory segment that can be accessed directly by the RapidsDB Execution Engine. For the Beta release, RapidsSE can run on a single node in the RapidsDB cluster. At this time, RapidsSE has no query processing capabilities, all query execution is handled by the RapidsDB Execution Engine, with RapidsSE purely providing in-memory storage for the data (in a csv format). RapidsDB is playing a similar role to Hive in Hadoop when accessing data stored in HDFS files, the difference being with RapidsSE the data is stored in memory Client API RapidsDB provides a web-based management console, the RapidsDB Manager, for configuring and managing the RapidsDB cluster. RapidsDB also provides a command line interface via the rapids-shell, and also provides a programmatic interface for Java-based clients. In the next release support will be provided for a JDBC interface and an SDK will be provided that will allow third parties and customers to build native interfaces to other languages, such as C++, python, etc. Page 7 Borrui Data Technology Co. Ltd 2016

1.2.6 Zookeeper RapidsDB uses Zookeeper (version 3.4.6 or later) for configuration management across the RapidsDB cluster. 1.

9 1.2.6 Zookeeper RapidsDB uses Zookeeper (version or later) for configuration management across the RapidsDB cluster. 1.3 RapidsDB Cluster Topology The diagram below shows the topology of a RapidsDB Cluster: rapids-shell provides the command line interface for sending queries to the RapidsDB Cluster DQC (Distributed Query Coordinator) within the RapidsDB Cluster there is one node that must be specified as the DQC. This is the node that the rapids-shell will send queries to. The DQC node can also behave as a DQE node (see below). DQE (Distributed Query Executor) the remaining nodes in the RapidsDB Cluster will perform the query execution and are used to connect to remote data sources via the RapidsDB Connectors (see section 2 for more details on Connectors). 2. Federations, Connectors and Naming 2.1 Overview In RapidsDB, a Federation is a logical grouping of a set of zero or more Connectors. Federations are named, and RapidsDB has a default Federation named DEFAULTFED. From the rapids-shell, the user can specify the Federation to be used by prefixing any query with the string: /*+ FEDERATION = <Federation name> */, for example: rapids > /*+ FEDERATION = Production */ select * from customer; Page 8 Borrui Data Technology Co. Ltd 2016

10 This query would return all of the rows from a table named customer from one of the Connectors in the production Federation. A query can only reference one Federation at a time. If no Federation name is specified, then the default Federation named DEFAULTFED will be used. By virtue of the fact that a Federation specifies a set of Connectors, the user will be limited to querying the tables that are accessible from the set of Connectors defined for the specified Federation. The diagram below is a simplified view of the default federation with 3 Connectors, with each Connector providing access to a set of tables: Using the default Federation, the user would be able to specify a query that accessed one or more of the tables shown. For example: SELECT COUNT(*) FROM VOLTDB.PUBLIC.EMPS; 2.2 Connectors A RapidsDB Connector provides the interface to an underlying data store. The Connector is responsible for maintaining the metadata about all of the tables that can be accessed from the underlying data store. A Connector is also responsible for executing the portions of the RapidsDB query execution plan that accesses table managed by the underlying data store. 2.3 Table Naming All tables in RapidsDB are identified using an ANSI standard 3-part name of <catalog>.<schema>.<table>. The 3-part name has to be unique within a Federation, if the name is ambiguous an error will be returned when the name is referenced in a query. Each Connector is responsible for managing its own 3-part name space. Where the underlying data source supports 3-part names, the Connector will use that 3-part name to identify each table. Where the underlying data source does not support a catalog and/or schema name, the Connector will provide the missing part of the name. The table below shows how the catalog names and schema names are assigned for the different Connectors supported in Release 3.0 Beta: Page 9 Borrui Data Technology Co. Ltd 2016

11 Connector Type Catalog Name Schema Name RapidsSE RAPIDSSE Schema name (user defined) MemSQL MEMSQL MemSQL database name VoltDB VOLTDB PUBLIC Stream STREAM PUBLIC RapidsDB will convert all names to upper case unless the name is enclosed in double quotes. For some data sources, such as MemSQL, the table name is case-sensitive, and for those data sources the table must be enclosed within double quotes when being referenced in a query sent to RapidsDB, unless the source table name is already all upper case. The catalog and schema names are case insensitive when being specified in a query sent to RapidsDB. Below are some example queries using the tables shown in the diagram above: select * from Hist ; select * from prod. Hist ; select * from PROD. Hist ; select * from voltdb.public.emps; 2.4 Name Resolution When resolving a table name in a query, RapidsDB will check with each Connector in the current Federation to see if that Connector has access to the specified table. Using the sample diagram from above, the following queries would all resolve to the Hist table accessed through the MemSQL Connector: select * from Hist ; select * from prod. Hist ; select * from memsql.prod. Hist ; When multiple Connectors have access to the same table name, then the user must qualify the table sufficiently to disambiguate the name. In the example above, the ORDERS table is accessible from the VoltDB and the Stream Connectors, and so a query such as select * from orders; would return an error because both the VoltDB and Stream connectors can access a table named ORDERS. To disambiguate the table name, the user would have to use one of the following: select * from voltdb.public.orders where -> would resolve to the VoltDB Connector select * from stream.public.orders where -> would resolve to the Stream Connector 2.5 Importing Tables When configuring Connectors (see sections 6.4 and 8.9), the user can specify which tables should be imported and made available to the user for querying. The term importing in this context means getting the metadata associated with a table from the associated data source and including that table name (including the catalog and schema names) in the table name space managed by the Connector for the specified Federation. The Import operation only applies to metadata, no actual table data will get Page 10 Borrui Data Technology Co. Ltd 2016

12 imported. The table data will be accessed as part of a query that references that table. By default, a Connector will import all of the tables that can be accessed from the underlying data source. For example, both VoltDB and MemSQL provide access to multiple tables, and by default, all of the tables will be imported. The user can restrict which tables will be imported when configuring a Connector by explicitly listing the catalogs, schemas and table names to be imported (see sections 6.4 and 8.9). 2.6 Mapping Catalog, Schema and Table Names When configuring a Connector, the user can map one or more parts of the 3-part name to local names that are used when querying through RapidsDB. The Connector maps these local names back to the original names used by the underlying data source. The diagram below shows two MemSQL Connectors, each with the table CUSTOMERS, but in different schemas named EAST and WEST : The query SELECT * FROM customers would fail with an error indicating that the table name was ambiguous. To disambiguate the name the query would have to be changed to SELECT * FROM east.customers to access the CUSTOMERS table managed by Connector MemSQL Connector1. When configuring the MemSQL Connectors the user could map the table named EAST.CUSTOMERS to EASTCUSTOMERS and the table named WEST.CUSTOMERS to WESTCUSTOMERS, and then the table names would be unique (for each Connector) and would not have to be qualified using the schema name, eg. SELECT * FROM eastcustomers WHERE Refer to the Installation and Management Guide, sections 6.5 and 8.9, for details on how to do name mapping. Page 11 Borrui Data Technology Co. Ltd 2016

13 3. Query Interfaces 3.1 RapidsDB shell Invoking the RapidsDB shell The RapidsDB shell can be accessed from the RapidsDB installation directory (refer to your system administrator for the location of this directory) by executing the rapids-shell.sh script file. In the following example the installation directory is /opt/rdp/current: 1. cd /opt/rdp/current 2../rapids-shell.sh 3. Queries can then be submitted. For example: 4. The formatting will display the heading information every 25 lines and will align the data accordingly: Page 12 Borrui Data Technology Co. Ltd 2016

14 3.1.2 RapidsDB shell Syntax The RapidsDB shell syntax is: [repeat_count] [ EXPLAIN [ALL] STATS ] <SQL statement> repeat_count is an integer value. If present, the statement is repeated the number of times specified. Specifying a repeat count causes the query output to be suppressed. Instead, for each repetition the RapidsDB shell outputs the number of rows produced and the elapsed time. Should not be used with the EXPLAIN option. EXPLAIN [ALL] if present, causes the RapidsDB shell to output a schematic representation of the query plan for the associated statement, including the SQL queries that will be sent to underlying Data Stores Page 13 Borrui Data Technology Co. Ltd 2016

15 and the internal operators and routing for operations that will be performed by DQS. By default, the EXPLAIN output will show the first, second and last branches of a tree that are otherwise identical with the exception of the partitions they run on. You can also supply the keyword ALL to the EXPLAIN command so that it does not truncate the explain output. E.g., EXPLAIN ALL SELECT STATS if present, causes the RapidsDB shell to display detailed execution statistics for the query, using the same format as generated by EXPLAIN. Each of the options are described in more detail in the following sections RapidsDB shell: EXPLAIN EXPLAIN instructs the RapidsDB shell to output a schematic representation of the query plan for the associated statement, including the SQL queries that will be sent to the underlying Data Store and the internal operators and routing for operations that will be performed by DQS. The query plan returned by EXPLAIN is static an does not contain any dynamic changes that may occur in the plan as it executes (e.g., Bloom Filters are generated and added into the plan dynamically based on data returned from the query, hence they do not get shown in EXPLAIN output). In the following examples, the text in the greyed boxes is explanatory text Simple SELECT The following example is a simple select against a replicated table, where the entire query will get executed by a single DQE and passed down to the underlying Data Store for execution. Condenser shows the query that will be passed down and executed on the Data Store Page 14 Borrui Data Technology Co. Ltd 2016

16 Aggregate Query The next example shows an aggregate query against a partitioned table where the table is partitioned across two nodes with 6 partitions per node: Only project the summation column and eliminate the O_CUSTKEY column. Compute final aggregates from the results provided by the two nodes 2 sub-plans will be generated, one for each node. The results from each node will be merged. Compute final aggregates for first node (ip addr ), At the node level there will be one plan for each local partition for a total of 6 sub-plans. The aggregates from each partition will then be merged. Aggregate query executed against each partition on the data store. Plan for the second node (ip addr ), which is a repeat of the first node way JOIN The following example is for a more complex query involving a one-way join across two tables that are partitioned across two nodes with 6 partitions per node. This example uses a Bloom Join (see 6.2.1). Page 15 Borrui Data Technology Co. Ltd 2016

17 2 sub-plans will be generated, one for each node. The results from each node will be merged. Plan for the first node. Eliminate columns not selected in the query. This represents the left-hand table of the join (ORDERS). Here we get the rows from the left-hand table from all of the partitions on this node to form the Bloom Filter that will be applied to rows from the right-side table. This represents the right-hand table of the join (LINEITEM). Merge the results from the Bloom Filter applied to the right-hand table (LINEITEM) on the 2 nodes. Apply the Bloom Filter against each partition of the LINEITEM table on the first node Apply the Bloom Filter against each partition of the LINEITEM table on the second node Plan for the second node (repeat of plan for first node) Page 16 Borrui Data Technology Co. Ltd 2016

18 3.1.4 RapidsDB shell: STATS The STATS option will cause the RapidsDB shell to display detailed execution statistics for the query, and should be used with the repeat_count (see above) set to a value of one so that the output from the query to the screen is suppressed (to remove the overhead associated with displaying the full set of results). This option is very useful for understanding the performance of a query. It also shows plan elements that were added to the plan dynamically and executed (e.g., Bloom Filters are dependent on data being read by the query so they are dynamically generated, added to the plan and executed. These don t get shown in EXPLAIN output, but they will be shown in STATS output since they were executed). The statistics returned are: Row Count the number of rows passed up to the next level of the plan Bytes Serialized, Bytes Deserialized Site: Node xxx.xxx.xxx.xxx:yyy the ip address and port number for the Data Store node Site: Node xxx.xxx.xxx.xxx:yyy Partition <n> - the statistics for the specified partition number For a Bloom Join (see 6.2.1) the following additional statistics are returned: o False positive rows the number of rows returned from the right side table after applying the Bloom filter that do not match the join condition. These are false positives from the Bloom Filter. o Bloom Filter Expected Inserts is the size of the Bloom Filter that will be executed on the right side of the plan. It is an estimate of the number of unique entries that RapidsDB is anticipating to be checked against the Bloom Filter from the right side table. Theoretically, the optimal size of the Bloom Filter would be the maximum of the number of unique values on the left and right side tables. RapidsDB is already reading the left table so RapidsDB can work out how many unique values there are there. However, RapidsDB does not know ahead of time how many unique values there are on the right hand side, and so RapidsDB uses the unique count from the left side as an estimate. If the unique count from the left side is below a minimum value (currently set to 10,240) then RapidsDB will set the size the Bloom Filter to that minimum. o Desired FPP the False Positive Probability. This the predicted ratio of false positives after applying the Bloom Filter, expressed as a percentage. E.g., a 1% FPP indicates that the the Bloom Filter should mistakenly return 1% of rows that do not match the join condition. This is a goal only and the actual amount of false positive rows returned from the Bloom Filter is dependent on the Bloom Filter size (estimated from reading the lefthand table) and the number of unique join values from all the rows on the right-hand table. The latter is not known ahead of time, so the actual amount of false positives rows returned will vary with each query. The statistics will only be displayed for one of the partitions from a Data Store node Simple SELECT The following is the STATS output for the simple SELECT example described in section Page 17 Borrui Data Technology Co. Ltd 2016

19 Aggregate Query The following is the STATS output for the aggregate query example described in section Return 75 rows for the query result set Return a total of 148 rows from both nodes Node 1: Compute aggregates from all partitions and return a total of 73 rows Node 2: Compute aggregates from all partitions and return a total of 75 rows Page 18 Borrui Data Technology Co. Ltd 2016

20 way JOIN The following is the STATS output for the 1-way JOIN example described in section Merge of the final results from the two nodes, where 12,000 rows were returned from the 2 nodes Node 1, returned 5,543 rows from the JOIN of the 11,322 rows from LINEITEM with the ORDERS table Node 1: Bloom Filter has 55,005 unique entries (out of 85,363) from ORDERS table Node 1: 85,363 rows from ORDERS table (across 6 partitions) Total of 11,322 rows returned from 2 nodes from LINEITEM table after applying Bloom Filter Node 2: 5,660 rows returned from the LINEITEM table after applying Bloom Filter. Node 2: For the first partition, the Bloom Filter returned 915 rows and filtered out 68,520 Node 1: 5,662 rows from LINEITEM table after applying Bloom Filter Page 19 Borrui Data Technology Co. Ltd 2016

21 Node 2, returned 6,457 rows from the JOIN of 13,857 rows from LINEITEM with ORDERS Node 2: Bloom Filter has 55,161 unique entries (out of 85,363) from ORDERS table Node 2: 85,574 rows from ORDERS table (across 6 partitions) Total of 13,857 rows returned from 2 nodes from LINEITEM table after applying Bloom Filter Node 2: 6,899 rows from LINEITEM table after applying Bloom Filter Node 1: 6,958 rows from LINEITEM table after applying Bloom Filter Page 20 Borrui Data Technology Co. Ltd 2016

22 3.1 Programmatic API RapidsDB shell Scripting Mode The RapidsDB shell provides a scripting option to facilitate the submission of queries programmatically. To run the RapidsDB shell in scripting mode the following two arguments must be passed to the RapidsDB shell: -s -c <path to cluster.config file> (see below for more details on invoking the RapidsDB shell programmatically) There is an optional third parameter, -t, which when specified will result in the second record of the query result containing a comma-separated list of the data types for each column returned (see 3 below). When run in scripting mode, the input/output is optimized for being parsed, not for human readability. The following describes the behavior of the RapidsDB shell when run in scripting mode: 1. The RapidsDB shell will execute the supplied query and then terminate after the results have been sent back to the client. 2. The query results will be returned on stdout in the format of a csv file, where the first record returned will be a comma-separated list of the column names from the query select list. If the -t option was specified when starting up the rapids-shell then the second record which will contain a comma-separated list of the data types for each column returned. The data types will be one of the following: BOOLEAN TINYINT SMALLINT INTEGER BIGINT FLOAT DECIMAL TIMESTAMP INTERVALDT (date interval) INTERVALYM (year-month interval) VARCHAR VARBINARY NULL (for the case of selecting literal nulls). The query results will then follow as a comma-separated set of values that correspond to the columns in the select list. 3. To aid in the processing of the results: Page 21 Borrui Data Technology Co. Ltd 2016

23 String column values are surrounded by double quotes. Double quote characters within a string are escaped by preceding them with an additional double quote. Timestamp fields are encoded as: TIMESTAMP yyyy-mm-dd hh:mm:ss.ffff. 4. Every other data type has no special encoding. 5. Any errors will be reported on stderr 6. When the query execution has been completed, the RapidsDB shell will terminate with either a zero completion code to indicate that the query completed successfully, or a non-zero error code when an error has occurred. Example: The following example shows the output generated when using the -t option to include the data types for the returned columns: $ echo "select * from columns where table_name = 'TABLES';"./rapids-shell.sh -s -t -c cluster.config "CATALOG_NAME","SCHEMA_NAME","TABLE_NAME","COLUMN_NAME","DATA_TYPE","ORDINAL", "IS_PARTITION_KEY" "VARCHAR","VARCHAR","VARCHAR","VARCHAR","VARCHAR","BIGINT","BOOLEAN" "RAPIDS","SYSTEM","TABLES","CATALOG_NAME","VARCHAR",1,false "RAPIDS","SYSTEM","TABLES","SCHEMA_NAME","VARCHAR",2,false "RAPIDS","SYSTEM","TABLES","TABLE_NAME","VARCHAR",3,false "RAPIDS","SYSTEM","TABLES","IS_PARTITIONED","BOOLEAN",4,false The following example shows the output generated without using the -t option: $ echo "select * from columns where table_name = 'TABLES';"./rapids-shell.sh -s -c cluster.config "CATALOG_NAME","SCHEMA_NAME","TABLE_NAME","COLUMN_NAME","DATA_TYPE","ORDINAL", "IS_PARTITION_KEY" "RAPIDS","SYSTEM","TABLES","CATALOG_NAME","VARCHAR",1,false "RAPIDS","SYSTEM","TABLES","SCHEMA_NAME","VARCHAR",2,false "RAPIDS","SYSTEM","TABLES","TABLE_NAME","VARCHAR",3,false "RAPIDS","SYSTEM","TABLES","IS_PARTITIONED","BOOLEAN",4,false Invoking the RapidsDB shell Programmatically from Java The following describes how to invoke the RapidsDB shell programmatically from Java: 1. Invoke the RapidsDB shell as an external process with the following two arguments: -s -c <path to cluster.config file>. For example, to run the RapidsDB shell directly in scripting mode: /opt/rdp/current/rapids-shell.sh -s -c /opt/rdp/current/cfg/cluster.config Page 22 Borrui Data Technology Co. Ltd 2016

24 2. Read from the stdout and stderr of the process until the process has exited and the input streams from the process s stdout and stderr have been drained. 3. To demonstrate how to invoke the RapidsDB shell from within a Java program a sample program named Scripter has been included in Appendix A of this manual. The Scripter program accepts one argument, which is the command used to invoke the RapidsDB shell, for example, if you are running the Scripter program from the /opt/rdp/current directory: java -cp.:./bin/*:./lib/* com.rapidsdata.cli.scripter "./rapids-shell.sh -s -c cfg/cluster.config" Note the double quotes around the command to run the RapidsDB shell. The Scripter program expects the first argument to be the complete command to run the RapidsDB shell. If you are not running from the "current" directory in the RDP installation, for example, if you are making your own version of Scripter, then the classpaths need to be modified in order to find all the dependencies. Assuming that a shell variable points to the RapidsDB installation directory, for example, RDP_DIR=/opt/rdp/current then the following command could be used to launch the application: java -cp.:$rdp_dir:$rdp_dir/bin/*:$rdp_dir/lib* com.rapidsdata.cli.scripter "java cp $RDP_DIR:$RDP_DIR/bin/*:$RDP_DIR/lib/* com.rapidsdata.cli.cli s c $RDP_DIR/cfg/cluster.config" 4. SQL Syntax This section describes the SQL syntax supported by RapidsDB. 4.1 Lexical Structure A SQL command is composed of a sequence of tokens, terminated by a semicolon (";"). Which tokens are valid depends on the syntax of the particular command. A token can be a keyword, an identifier, a quoted identifier, a literal (or constant), or a special character symbol. Additionally, comments can occur in SQL input. Comments are not tokens, they are effectively equivalent to whitespace. Here is an example of syntactically valid SQL command: SELECT * FROM CUSTOMER WHERE CUSTOMER_NAME = Smith ; Refer to Appendix B for details of the SQL grammar supported by the RapidsDS DQS Identifiers and Keywords Tokens such as SELECT or WHERE are examples of keywords, that is, words that have a fixed meaning in the SQL language. In the example query above, the tokens CUSTOMER and CUSTOMER_NAME are examples of identifiers. They identify names of tables, columns, or other database objects, depending Page 23 Borrui Data Technology Co. Ltd 2016

25 on the command they are used in. Therefore they are sometimes simply called "names". Keywords and identifiers have the same lexical structure, meaning that one cannot know whether a token is an identifier or a keyword without knowing the language. SQL identifiers and keywords must begin with a letter (a-z, but also letters with diacritical marks and non-latin letters) or an underscore (_). Subsequent characters in an identifier or keyword can be letters, underscores, or digits (0-9). The maximum length of an identifier will be dependent on the underlying Data Store. Keywords and unquoted identifiers are case insensitive. Therefore: SELECT * FROM CUSTOMER WHERE CUSTOMER_NAME = Smith ; can equivalently be written as: Select * FroM customer Where Customer_name = Smith ; A convention often used is to write keywords in upper case and names in lower case, e.g.: SELECT * FROM customer WHERE customer_name = Smith ; There is a second kind of identifier: the delimited identifier or quoted identifier. It is formed by enclosing an arbitrary sequence of characters in double-quotes ("). A delimited identifier is always an identifier, never a keyword. So "select" could be used to refer to a column or table named "select", whereas an unquoted select would be taken as a keyword and would therefore provoke a parse error when used where a table or column name is expected. The example can be written with quoted identifiers like this: SELECT * FROM CUSTOMER WHERE CUSTOMER_NAME = Smith ; Quoted identifiers can contain any character, except the character with code zero. (To include a double quote, write two double quotes.) This allows constructing table or column names that would otherwise not be possible, such as ones containing spaces or ampersands. The length limitation still applies. Quoted identifiers are case sensitive. NOTE: Support for quoted identifiers will be dependent on the underlying Data Store also supporting quoted identifiers Constants There are two kinds of implicitly-typed constants: strings, and numbers. Constants can also be specified with explicit types. These alternatives are discussed in the following subsections String Constants A string constant is an arbitrary sequence of characters bounded by single quotes ('), for example 'This is a string'. To include a single-quote character within a string constant, write two adjacent single quotes, e.g., 'Dianne''s horse'. Note that this is not the same as a double-quote character ("). Page 24 Borrui Data Technology Co. Ltd 2016

26 Numeric Constants Numeric constants are accepted in these general forms: digits digits.[digits][e[+-]digits] [digits].digits[e[+-]digits] digitse[+-]digits where digits is one or more decimal digits (0 through 9). At least one digit must be before or after the decimal point, if one is used. At least one digit must follow the exponent marker (e), if one is present. There cannot be any spaces or other characters embedded in the constant. Note that any leading plus or minus sign is not actually considered part of the constant; it is an operator applied to the constant. These are some examples of valid numeric constants: e e-3 A numeric constant that contains neither a decimal point nor an exponent is initially presumed to be type integer. Constants that contain decimal points and/or exponents are always initially presumed to be type numeric Operators The following operators are supported: '+' '-' '*' '/' '%' ' ' '<' '<=' '=' '!=' '<>' '>=' '>' Page 25 Borrui Data Technology Co. Ltd 2016

27 4.1.4 Special Characters Some characters that are not alphanumeric have a special meaning that is different from being an operator. Details on the usage can be found at the location where the respective syntax element is described. This section only exists to advise the existence and summarize the purposes of these characters. Parentheses (()) have their usual meaning to group expressions and enforce precedence. In some cases parentheses are required as part of the fixed syntax of a particular SQL command. Commas (,) are used in some syntactical constructs to separate the elements of a list. The semicolon (;) terminates an SQL command. It cannot appear anywhere within a command, except within a string constant or quoted identifier. The asterisk (*) is used in some contexts to denote all the fields of a table row or composite value. The period (.) is used in numeric constants, and to separate schema, table, and column names Comments A C-style block comments can be used where the comment begins with /* and extends to the matching occurrence of */. These block comments cannot nest. A comment is removed from the input stream before further syntax analysis and is effectively replaced by whitespace. Example: SELECT /*Customer query */ * FROM customer /* Source table */ WHERE customer_name = Smith ; Operator Precedence The table below shows the operator precedence rules: Operator Associativity Description. Left Table/column name separator + - Right Unary plus, unary minus * / % Left Multiplication, division, modulo + - Left Addition, subtraction BETWEEN IN LIKE < > = <= >= <> Comparison operators IS NULL, IS NOT NULL NOT Right AND Left OR left Page 26 Borrui Data Technology Co. Ltd 2016

4.2 Data Types and Type Specifiers 4.2.1 Data Types RapidsDB supports the following data types: INTEGER DECIMAL FLOAT TIMESTAMP VARCHAR VARBINARY 4.2.2 Type Specifiers A RapidsDB type specifier specifies a data type and desired precision.

28 4.2 Data Types and Type Specifiers Data Types RapidsDB supports the following data types: INTEGER DECIMAL FLOAT TIMESTAMP VARCHAR VARBINARY Type Specifiers A RapidsDB type specifier specifies a data type and desired precision. Type specifiers are used in the CAST operator (see 4.3.5) and also the column definitions in CREATE TABLE statements (see 9.1) and the USING clause of the CREATE CONNECTOR statement (see Installation and Management Guide). The interpretation of size, precision and scale values in a RapidsDB type specifier depends on the data type. In this release the interpretations are as follows: Type Default interpretation of size / scale / precision INTEGER Precision in decimal digits (if unspecified: 18) DECIMAL Precision in decimal digits, scale in decimal digits (if unspecified: 19, 0) FLOAT Size of mantissa in binary digits (if unspecified: 53) VARCHAR Physical size in bytes (if unspecifed: variable up to 2G) VARBINARY Physical size in bytes (if unspecified: variable up to 2G) NOTES: Page 27 Borrui Data Technology Co. Ltd 2016

29 1. The value for precision of a FLOAT is interpreted as the number of binary digits in the mantissa. This is per ANSI SQL. A 53-bit mantissa corresponds to a standard 64-bit IEEE double precision floating point value. Expressed in decimal, the precision is 22 digits Use in CAST In principle, CAST should precisely convert a value to the specified data type and precision or generate an error if this is not possible. So, for example, CAST(myColumn AS INTEGER(3)) should produce a value between -999 and +999 or generate an overflow exception if the value of mycolumn lies outside that range. In practice, the behavior may deviate in certain cases, depending on whether the CAST operation is pushed down to the underlying data store (in particular, some data stores fail to generate overflow exceptions where expected) Use in Column Definitions When used in a column definition (CREATE TABLE or CREATE CONNECTOR WITH TABLE USING), a type specifier specifies the minimum requirement for a column. The underlying system may optionally substitute a definition of greater size or precision. It may not, however, substitute a definition of lesser size or precision. So, for example, INTEGER(4) specifies a column that can store values in the range to (and obeys the rules for integer math). An underlying data store may create the corresponding physical database column as, for example, a 16-bit integer. If a column definition specifies a capability that exceeds the maximum capability of either the RapidsDB Execution Engine or an underlying Data Store, the definition will be rejected. For example, INTEGER(22) will be rejected by RapidsDB because the maximum precision for integers in Rapids is 19 decimal digits. DECIMAL(20,15) will be rejected by the VoltDB connector because the maximum fractional digits for decimal values in VoltDB is System Metadata Column information in the COLUMNS table in the RapidsDB System Metadata (see 11.8) reflects the type, size, precision and scale of columns as reported by the underlying system and interpreted by RapidsDB. The information may differ from the definitions used to create the tables. The column data types are shown as standardized RapidsDB types. Size, precision and scale may exceed the values specified in a RapidsDB CREATE TABLE statement as noted above Internal Precision The type specifiers affect storage, reading, writing and conversion of data values but do not control the precision of calculations on those values during query execution. Query calculations are performed by the RapidsDB Execution Engine (or the query engines of underlying Data Stores) with precision no less than the following: For integer values, 64-bit signed integers. For floating point values, 64-bit double precision (Java IEEE 754). For character values, Java String values of up to 2GB (using UTF-16 encoding). For binary values, Java byte arrays of up to 2GB in size. Page 28 Borrui Data Technology Co. Ltd 2016

30 4.3 Value Expressions Value expressions are used in a variety of contexts, such as in the target list of the SELECT command or in search conditions. The result of a value expression is sometimes called a scalar, to distinguish it from the result of a table expression (which is a table). Value expressions are therefore also called scalar expressions (or even simply expressions). The expression syntax allows the calculation of values from primitive parts using arithmetic, logical, set, and other operations. A value expression is one of the following: A constant or literal value A column reference An operator invocation A function call An aggregate expression A type cast A scalar subquery Another value expression in parentheses (used to group subexpressions and override precedence) In addition to this list, there are a number of constructs that can be classified as an expression but do not follow any general syntax rules. These generally have the semantics of a function or operator and are explained in the appropriate location in Chapter 5. An example is the IS NULL clause. We have already discussed constants in Section The following sections discuss the remaining options Column References A column can be referenced in the form: correlation.columnname correlation is the name of a table, or an alias for a table defined by means of a FROM clause. The correlation name and separating dot can be omitted if the column name is unique across all the tables being used in the current query Operator Invocation There are three possible syntaxes for an operator invocation: expression operator expression (binary infix operator) operator expression (unary prefix operator) expression operator (unary postfix operator) where the operator token follows the syntax rules of Section 3.1.3, or is one of the keywords AND, OR, and NOT Page 29 Borrui Data Technology Co. Ltd 2016

31 4.3.3 Function Call The syntax for a function call is the name of a function followed by its argument list enclosed in parentheses: function_name ([expression [, expression... ]] ) For example, the following computes maximum value of column C1: max(c1) The list of built-in functions is in Section Aggregate Expression An aggregate expression represents the application of an aggregate function across the rows selected by a query. An aggregate function reduces multiple inputs to a single output value, such as the sum or average of the inputs. The syntax of an aggregate expression is one of the following: aggregate_name (expression [,... ] ) aggregate_name (ALL expression [,... ]) aggregate_name (DISTINCT expression [,... ]) where aggregate_name is a system-defined aggregate and expression is any value expression that does not itself contain an aggregate. The first form of aggregate expression invokes the aggregate once for each input row. The second form is the same as the first, since ALL is the default. The third form invokes the aggregate once for each distinct value of the expression (or distinct set of values, for multiple expressions) found in the input rows. Most aggregate functions ignore null inputs, so that rows in which one or more of the expression(s) yield null are discarded. For example, count(*) yields the total number of input rows; count(f1) yields the number of input rows in which f1 is non-null, since count ignores nulls; and count(distinct f1) yields the number of distinct non-null values of f Type Cast A type cast specifies a conversion from one data type to another: CAST ( expression AS type ) When a cast is applied to a value expression of a known type, it represents a run-time type conversion. The cast will succeed only if a suitable type conversion operation has been defined. A cast applied to an unadorned string literal represents the initial assignment of a type to a literal constant value, and so it will succeed for any type (if the contents of the string literal are acceptable input syntax for the data type). Page 30 Borrui Data Technology Co. Ltd 2016

32 The table below shows the supported cast operations: Target Source INTEGER DECIMAL FLOAT STRING TIMESTAMP BOOLEAN BINARY NULL INTEGER Y Y Y Y N N N N DECIMAL Y Y Y Y N N N N FLOAT Y Y Y Y N N N N STRING Y Y Y Y Y Y N N TIMESTAMP N N N Y Y N N N BOOLEAN N N N Y N Y N N BINARY N N N N N N Y N NULL Y Y Y Y Y Y Y Y When casting to the Float type, the precision will be to one digit. When casting to a Decimal type, the precision will be to 9 digits Decimal Expressions and Precision Decimal expressions are mathematical expressions with a data of type DECIMAL. Decimal values occur either because a column is defined as type DECIMAL or because a value is converted to DECIMAL using the CAST operator. A decimal value has a precision, which is the total number of significant digits in the value, and a scale, which is the number of digits to the right of the decimal point. When the RapidsDB Query Planner analyzes a decimal expression, it determines a "canonical" precision and scale for each operator in the expression. When the query is executed, theses value control the precision and scale of the decimal calculations. Generally, the canonical precision and scale are designed to preserve sufficient decimal places to fully represent the possible result of the calculation. The following table summarizes the rules used. p1 and s1 represent the precision and scale of the first operand of a math operator; p2 and s2 represent the precision and scale of the second operand. Operation Canonical Precision and Scale + or - Scale = max(s1, s2) Precision = max(p1 - s1, p2 - s2) scale * Scale = s1 + s2 Precision = p1 + p2 + 1 / Scale = max(4, s1 + p2 + 1) Precision = p1 - s1 + s2 + scale Page 31 Borrui Data Technology Co. Ltd 2016

33 Note that in all cases, the maximum precision of a decimal calculation is 38 digits and the maximum scale is 38 digits. During execution of a query, if a calculation produces a result whose precision would exceed 38 digits, the scale is reduced to preserve the integral part of the result. The precision and scale of a decimal result can be controlled explicitly using the CAST operator (see 4.3.5) Scalar Subquery A scalar subquery is an ordinary SELECT query in parentheses that returns exactly one row with one column. (See Chapter 5 for information about writing queries.) The SELECT query is executed and the single returned value is used in the surrounding value expression. It is an error to use a query that returns more than one row or more than one column as a scalar subquery. (But if, during a particular execution, the subquery returns no rows, there is no error; the scalar result is taken to be null.) The subquery can refer to variables from the surrounding query, which will act as constants during any one evaluation of the subquery. For example, the following finds the largest city population in each state: SELECT name, (SELECT max(pop) FROM cities WHERE cities.state = states.name) FROM states; Expression Evaluation Rules The order of evaluation of subexpressions is not defined. In particular, the inputs of an operator or function are not necessarily evaluated left-to-right or in any other fixed order. Furthermore, if the result of an expression can be determined by evaluating only some parts of it, then other subexpressions might not be evaluated at all. For instance, if one wrote: SELECT true OR somefunc(); then somefunc() would (probably) not be called at all. As a consequence, it is unwise to use functions with side effects as part of complex expressions. It is particularly dangerous to rely on side effects or evaluation order in WHERE and HAVING clauses, since those clauses are extensively reprocessed as part of developing an execution plan. When it is essential to force evaluation order, a CASE construct (see 5.7.1) can be used. For example, this is an untrustworthy way of trying to avoid division by zero in a WHERE clause: SELECT... WHERE x > 0 AND y/x > 1.5; But this is safe: SELECT... WHERE CASE WHEN x > 0 THEN y/x > 1.5 ELSE false END; There are situations where a CASE statement will not always prevent problems. For example, a CASE cannot prevent evaluation of an aggregate expression contained within it, because aggregate expressions are computed before other expressions in a SELECT list or HAVING clause are considered. Page 32 Borrui Data Technology Co. Ltd 2016

34 For example, the following query can cause a division-by-zero error despite seemingly having protected against it: SELECT CASE WHEN min(employees) > 0 THEN avg(expenses / employees) END FROM departments; The min() and avg() aggregates are computed concurrently over all the input rows, so if any row has employees equal to zero, the division-by-zero error will occur before there is any opportunity to test the result of min(). Instead, use a WHERE clause to prevent problematic input rows from reaching an aggregate function in the first place. 5. Queries 5.1 Overview The general syntax of the SELECT command is SELECT select_list FROM table_expression [sort_specification] The following sections describe the details of the select list, the table expression, and the sort specification. A simple kind of query has the form: SELECT * FROM table1; The select list specification * means all columns that the table expression happens to provide. A select list can also select a subset of the available columns or make calculations using the columns. For example, if table1 has columns named a, b, and c you can make the following query: SELECT a, b + c FROM table1; (assuming that b and c are of a numerical data type). See Section 4.3 for more details. FROM table1 is a simple kind of table expression: it reads just one table. In general, table expressions can be complex constructs of base tables, joins, and subqueries. But you can also omit the table expression entirely and use the SELECT command as a calculator: SELECT 3 * 4; This is more useful if the expressions in the select list return varying results. For example, you could call a function this way: SELECT random(); Page 33 Borrui Data Technology Co. Ltd 2016

35 5.2 Table Expressions A table expression computes a table. The table expression contains a FROM clause that is optionally followed by WHERE, GROUP BY, and HAVING clauses. Trivial table expressions simply refer to a table, a so-called base table, but more complex expressions can be used to modify or combine base tables in various ways. The optional WHERE, GROUP BY, and HAVING clauses in the table expression specify a pipeline of successive transformations performed on the table derived in the FROM clause. All these transformations produce a virtual table that provides the rows that are passed to the select list to compute the output rows of the query The FROM Clause The FROM Clause derives a table from one or more other tables given in a comma-separated table reference list. FROM table_reference [, table_reference [,...]] A table reference can be a table name (optionally qualified by <catalog>.<schema> or <schema>), or a derived table such as a subquery, a JOIN construct, or complex combinations of these. If more than one table reference is listed in the FROM clause, the tables are cross-joined (that is, the Cartesian product of their rows is formed; see below). The result of the FROM list is an intermediate virtual table that can then be subject to transformations by the WHERE, GROUP BY, and HAVING clauses and is finally the result of the overall table expression Joined Tables A joined table is a table derived from two other (real or derived) tables according to the rules of the particular join type. Inner, outer, and cross-joins are available. The general syntax of a joined table is T1 join_type T2 [ join_condition ] Joins of all types can be chained together, or nested: either or both T1 and T2 can be joined tables. Parentheses can be used around JOIN clauses to control the join order. In the absence of parentheses, JOIN clauses nest left-to-right. The following describes the Join Types supported: CROSS JOIN T1 CROSS JOIN T2 For every possible combination of rows from T1 and T2 (i.e., a Cartesian product), the joined table will contain a row consisting of all columns in T1 followed by all columns in T2. If the tables have N and M rows respectively, the joined table will have N * M rows. Page 34 Borrui Data Technology Co. Ltd 2016

36 INNER JOIN For each row R1 of T1, the joined table has a row for each row in T2 that satisfies the join condition with R LEFT OUTER JOIN First, an inner join is performed. Then, for each row in T1 that does not satisfy the join condition with any row in T2, a joined row is added with null values in columns of T2. Thus, the joined table always has at least one row for each row in T RIGHT OUTER JOIN First, an inner join is performed. Then, for each row in T2 that does not satisfy the join condition with any row in T1, a joined row is added with null values in columns of T1. This is the converse of a left join: the result table will always have a row for each row in T ON Clause The ON clause is the most general kind of join condition: it takes a Boolean value expression of the same kind as is used in a WHERE clause. A pair of rows from T1 and T2 match if the ON expression evaluates to true USING Clause The USING clause is a shorthand that allows you to take advantage of the specific situation where both sides of the join use the same name for the joining column(s). It takes a comma-separated list of the shared column names and forms a join condition that includes an equality comparison for each one. For example, joining T1 and T2 with USING (a, b) produces the join condition ON T1.a = T2.a AND T1.b = T2.b. To put this together, assume we have tables t1, with columns num and name: num name 1 a 2 b 3 c and t2 with columns num and value: num value 1 xxx 3 yyy 5 zzz then we get the following results for the various joins: Page 35 Borrui Data Technology Co. Ltd 2016

The join condition specified with ON can also contain

This can prove useful for some queries but needs to be

For example: Notice that placing the restriction in

37 The join condition specified with ON can also contain conditions that do not relate directly to the join. This can prove useful for some queries but needs to be thought out carefully. For example: Notice that placing the restriction in the WHERE clause produces a different result: Page 36 Borrui Data Technology Co. Ltd 2016

This is because a restriction placed in the ON clause is processed before the join, while a restriction placed in the WHERE clause is processed after the join.

38 This is because a restriction placed in the ON clause is processed before the join, while a restriction placed in the WHERE clause is processed after the join. That does not matter with inner joins, but it matters a lot with outer joins Table and Column Aliases A temporary name can be given to tables and complex table references to be used for references to the derived table in the rest of the query. This is called a table alias. To create a table alias, write FROM table_reference AS alias Or FROM table_reference alias The AS keyword is optional noise. alias can be any identifier. A typical application of table aliases is to assign short identifiers to long table names to keep the join clauses readable. For example: SELECT * FROM some_very_long_table_name s JOIN another_fairly_long_name a ON s.id = a.num; The alias becomes the new name of the table reference so far as the current query is concerned it is not allowed to refer to the table by the original name elsewhere in the query. Thus, this is not valid: SELECT * FROM my_table AS m WHERE my_table.a > 5; -- wrong Table aliases are mainly for notational convenience, but it is necessary to use them when joining a table to itself, e.g.: SELECT * FROM people AS mother JOIN people AS child ON mother.id = child.mother_id; Additionally, an alias is required if the table reference is a subquery (see Section ). Parentheses are used to resolve ambiguities. In the following example, the first statement assigns the alias b to the second instance of my_table, but the second statement assigns the alias to the result of the join: SELECT * FROM my_table AS a CROSS JOIN my_table AS b... SELECT * FROM (my_table AS a CROSS JOIN my_table) AS b... Page 37 Borrui Data Technology Co. Ltd 2016

39 Another form of table aliasing gives temporary names to the columns of the table, as well as the table itself: FROM table_reference [AS] alias ( column1 [, column2 [,...]] ) If fewer column aliases are specified than the actual table has columns, the remaining columns are not renamed. This syntax is especially useful for self-joins or subqueries. When an alias is applied to the output of a JOIN clause, the alias hides the original name(s) within the JOIN. For example: SELECT a.* FROM my_table AS a JOIN your_table AS b ON... is valid SQL, but: SELECT a.* FROM (my_table AS a JOIN your_table AS b ON...) AS c is not valid; the table alias a is not visible outside the alias c. NOTE: The alias name cannot be a reserved word for the underlying data store where the data resides. If a reserved word is chosen the query will fail, and for some data stores, the query will still fail if the alias name is put in double quotes. To resolve the problem the alias name must be changed. If the alias name must be retained, then add a space character after the alias name and enclose the alias name along with the space character in double quotes. For example, the following query will fail against MemSQL because the alias name INT1 is a reserved word: select cast(f_col1 as integer) as int1 from t1 where f_col2 > 480 order by int1; To preserve the name int1, the query would be rewritten as: select cast(f_col1 as integer) as int1 from t1 where f_col2 > 480 order by int1 ; Subqueries Subqueries specifying a derived table must be enclosed in parentheses and must be assigned a table alias name (as in Section ). For example: FROM (SELECT * FROM table1) AS alias_name This example is equivalent to FROM table1 AS alias_name. More interesting cases, which cannot be reduced to a plain join, arise when the subquery involves grouping or aggregation. For more information see Section WHERE Clause The syntax of the WHERE Clause is Page 38 Borrui Data Technology Co. Ltd 2016

40 WHERE search_condition where search_condition is any value expression (see Section 3.2) that returns a value of type boolean. After the processing of the FROM clause is done, each row of the derived virtual table is checked against the search condition. If the result of the condition is true, the row is kept in the output table, otherwise (i.e., if the result is false or null) it is discarded. Here are some examples of WHERE clauses: SELECT... FROM fdt WHERE c1 > 5 SELECT... FROM fdt WHERE c1 IN (1, 2, 3) SELECT... FROM fdt WHERE c1 IN (SELECT c1 FROM t2) SELECT... FROM fdt WHERE c1 IN (SELECT c3 FROM t2 WHERE c2 = fdt.c1 + 10) SELECT... FROM fdt WHERE c1 BETWEEN (SELECT c3 FROM t2 WHERE c2 = fdt.c1 + 10) AND 100 SELECT... FROM fdt WHERE EXISTS (SELECT c1 FROM t2 WHERE c2 > fdt.c1) fdt is the table derived in the FROM clause. Rows that do not meet the search condition of the WHERE clause are eliminated from fdt. Notice the use of scalar subqueries as value expressions. Just like any other query, the subqueries can employ complex table expressions. Notice also how fdt is referenced in the subqueries. Qualifying c1 as fdt.c1 is only necessary if c1 is also the name of a column in the derived input table of the subquery. But qualifying the column name adds clarity even when it is not needed. This example shows how the column naming scope of an outer query extends into its inner queries GROUP BY and HAVING Clause After passing the WHERE filter, the derived input table might be subject to grouping, using the GROUP BY clause, and elimination of group rows using the HAVING clause. SELECT select_list FROM... [WHERE...] GROUP BY exprlist The GROUP BY Clause is used to group together those rows in a table that have the same values in all the columns listed. The order in which the columns are listed does not matter. The effect is to combine each set of rows having common values into one group row that represents all rows in the group. This is done to eliminate redundancy in the output and/or compute aggregates that apply to these groups. If a table has been grouped using GROUP BY, but only certain groups are of interest, the HAVING clause can be used, much like a WHERE clause, to eliminate groups from the result. The syntax is: SELECT select_list FROM... [WHERE...] GROUP BY... HAVING boolean_expression Page 39 Borrui Data Technology Co. Ltd 2016

41 Expressions in the HAVING clause can refer both to grouped expressions and to ungrouped expressions (which necessarily involve an aggregate function). Examples: SELECT x, sum(y) FROM test1 GROUP BY x HAVING sum(y) > 3; SELECT x, sum(y) FROM test1 GROUP BY x HAVING x < 'c'; SELECT product_id, p.name, (sum(s.units) * (p.price - p.cost)) AS profit FROM products p LEFT JOIN sales s USING (product_id) WHERE s.date > ' :00:00' AND s.date < ' :00:00' GROUP BY product_id, p.name, p.price, p.cost HAVING sum(p.price * s.units) > 5000; In the example above, the WHERE clause is selecting rows by a column that is not grouped (the expression is only true for sales during the month of June), while the HAVING clause restricts the output to groups with total gross sales over Note that the aggregate expressions do not necessarily need to be the same in all parts of the query. 5.3 SELECT Lists As shown in the previous section, the table expression in the SELECT command constructs an intermediate virtual table by possibly combining tables, views, eliminating rows, grouping, etc. This table is finally passed on to processing by the select list. The select list determines which columns of the intermediate table are actually output SELECT List Items The simplest kind of select list is * which emits all columns that the table expression produces. Otherwise, a select list is a comma-separated list of value expressions (as defined in Section 3.2). For instance, it could be a list of column names: SELECT a, b, c FROM... The columns names a, b, and c are either the actual names of the columns of tables referenced in the FROM clause, or the aliases given to them as explained in Section The name space available in the select list is the same as in the WHERE clause, unless grouping is used, in which case it is the same as in the HAVING clause. If more than one table has a column of the same name, the table name must also be given, as in: SELECT tbl1.a, tbl2.a, tbl1.b FROM... When working with multiple tables, it can also be useful to ask for all the columns of a particular table: SELECT tbl1.*, tbl2.a FROM... Page 40 Borrui Data Technology Co. Ltd 2016

42 (See also Section ) If an arbitrary value expression is used in the select list, it conceptually adds a new virtual column to the returned table. The value expression is evaluated once for each result row, with the row's values substituted for any column references. But the expressions in the select list do not have to reference any columns in the table expression of the FROM clause; they can be constant arithmetic expressions, for instance Column Labels The entries in the select list can be assigned names for subsequent processing, such as for use in an ORDER BY clause or for display by the client application. For example: SELECT a AS value, b + c AS sum FROM... If no output column name is specified using AS, the system assigns a default column name. For simple column references, this is the name of the referenced column. For complex expressions, the system will generate a generic name DISTINCT After the select list has been processed, the result table can optionally be subject to the elimination of duplicate rows. The DISTINCT keyword is written directly after SELECT to specify this: SELECT DISTINCT select_list... (Instead of DISTINCT the keyword ALL can be used to specify the default behavior of retaining all rows.) Obviously, two rows are considered distinct if they differ in at least one column value. Null values are considered equal in this comparison. 5.4 Combining Queries The results of two queries can be combined using the UNION set operation. The syntax is query1 UNION [ALL] query2 query1 and query2 are queries that can use any of the features discussed up to this point. Set operations can also be nested and chained, for example query1 UNION query2 UNION query3 which is executed as: (query1 UNION query2) UNION query3 UNION effectively appends the result of query2 to the result of query1 (although there is no guarantee that this is the order in which the rows are actually returned). Furthermore, it eliminates duplicate rows from its result, in the same way as DISTINCT, unless UNION ALL is used. Page 41 Borrui Data Technology Co. Ltd 2016

43 In order to calculate the union of two queries, the two queries must be "union compatible", which means that they return the same number of columns and the corresponding columns have compatible data types. 5.5 ORDER BY After a query has produced an output table (after the select list has been processed) it can optionally be sorted. If sorting is not chosen, the rows will be returned in an unspecified order. The actual order in that case will depend on the scan and join plan types, but it must not be relied on. A particular output ordering can only be guaranteed if the sort step is explicitly chosen. The ORDER BY clause specifies the sort order: SELECT select_list FROM table_expression ORDER BY orderbylist [ASC DESC] The orderbylist can be any expression that would be valid in the query's select list. An example is: SELECT a, b FROM table1 ORDER BY a + b, c; When more than one expression is specified, the later values are used to sort rows that are equal according to the earlier values. Each expression can be followed by an optional ASC or DESC keyword to set the sort direction to ascending or descending. ASC order is the default. Ascending order puts smaller values first, where "smaller" is defined in terms of the < operator. Similarly, descending order is determined with the > operator. Note that the ordering options are considered independently for each sort column. For example ORDER BY x, y DESC means ORDER BY x ASC, y DESC, which is not the same as ORDER BY x DESC, y DESC. A sort_expression can also be the column label or number of an output column, as in: SELECT a + b AS sum, c FROM table1 ORDER BY sum; SELECT a, max(b) FROM table1 GROUP BY a ORDER BY 1; both of which sort by the first output column. ORDER BY can be applied to the result of a UNION, INTERSECT, or EXCEPT combination, but in this case it is only permitted to sort by output column names or numbers, not by expressions. 5.6 LIMIT and OFFSET LIMIT and OFFSET allow you to retrieve just a portion of the rows that are generated by the rest of the query: SELECT select_list FROM table_expression Page 42 Borrui Data Technology Co. Ltd 2016

44 [ ORDER BY... ] [ LIMIT { number } ] [ OFFSET number ] If a limit count is given, no more than that many rows will be returned (but possibly less, if the query itself yields less rows). LIMIT ALL is the same as omitting the LIMIT clause. OFFSET says to skip that many rows before beginning to return rows. OFFSET 0 is the same as omitting the OFFSET clause. If both OFFSET and LIMIT appear, then OFFSET rows are skipped before starting to count the LIMIT rows that are returned. When using LIMIT, it is important to use an ORDER BY clause that constrains the result rows into a unique order. Otherwise you will get an unpredictable subset of the query's rows. You might be asking for the tenth through twentieth rows, but tenth through twentieth in what ordering? The ordering is unknown, unless you specified ORDER BY. The query optimizer takes LIMIT into account when generating query plans, so you are very likely to get different plans (yielding different row orders) depending on what you give for LIMIT and OFFSET. Thus, using different LIMIT/OFFSET values to select different subsets of a query result will give inconsistent results unless you enforce a predictable result ordering with ORDER BY. The rows skipped by an OFFSET clause still have to be computed inside the server; therefore a large OFFSET might be inefficient. 6. Functions and Operators 6.1 Logical Operators The usual logical operators are available: AND OR NOT SQL uses a three-valued logic system with true, false, and null, which represents "unknown". Observe the following truth tables: a b a AND b a OR b TRUE TRUE TRUE TRUE TRUE FALSE FALSE TRUE TRUE NULL NULL TRUE FALSE FALSE FALSE FALSE FALSE NULL FALSE NULL NULL NULL NULL NULL Page 43 Borrui Data Technology Co. Ltd 2016

45 a TRUE FALSE NULL NOT a FALSE TRUE NULL The operators AND and OR are commutative, that is, you can switch the left and right operand without affecting the result. 6.2 Comparison Operators and BETWEEN The usual comparison operators are available: Operator Description < less than > greater than <= less than or equal to >= greater than or equal to = equal <> or!= not equal Comparison operators are available for all relevant data types. All comparison operators are binary operators that return values of type Boolean. In addition to the comparison operators, the special BETWEEN construct is available: is equivalent to a BETWEEN x AND y a >= x AND a <= y Notice that BETWEEN treats the endpoint values as included in the range. NOT BETWEEN does the opposite comparison: is equivalent to a NOT BETWEEN x AND y a < x OR a > y To check whether a value is or is not null, use the constructs: expression IS NULL expression IS NOT NULL Page 44 Borrui Data Technology Co. Ltd 2016

46 Do not write expression = NULL because NULL is not "equal to" NULL. (The null value represents an unknown value, and it is not known whether two unknown values are equal.) This behavior conforms to the SQL standard. Boolean values can also be tested using the constructs expression IS TRUE expression IS NOT TRUE expression IS FALSE expression IS NOT FALSE 6.3 Mathematical Operators and Functions Mathematical Operators Operator Description Example Result + addition subtraction * multiplication 2 * 3 6 / division (integer division 4 / 2 2 truncates the result) % modulo (remainder) 5 % 4 1 Mathematical Functions Function Return Type Description Example Result abs(x) (same as input) absolute value abs(-17.4) 17.4 ceil(numeric) integer smallest integer not ceil(-42.8) -42 less than argument ceiling(numeric) integer smallest integer not ceiling(-95.3) -95 less than argument (alias for ceil) floor(numeric) integer largest integer not floor(-42.8) -43 greater than argument power(numeric, numeric) float raise first argument to power(9, 2) 81.0 the power of the second argument pow(numeric, numeric) float raise first argument to pow(9, 2) 81.0 the power of the second argument round(numeric) integer round to nearest integer round(123.99) 123 round(numeric, int) float round to int number of decimal places stddev(expression) float historical alias for stddev_samp stddev_pop(expression) float population standard deviation of the input round(( ,2) Page 45 Borrui Data Technology Co. Ltd 2016

47 values stddev_samp(expression) float sample standard deviation of the input values variance(expression) float historical alias for var_samp var_pop(expression) float population variance of the input values (square of the population standard deviation) var_samp(expression) float sample variance of the input values (square of the sample standard deviation) char_length(string) or character_length(string) int 6.4 String Functions and Operators Function Return Type Description Example Result string string text String 'Post' 'gresql' PostgreSQL concatenation string non-string or non-string string text String concatenation with one non- 'Value: ' 42 Value: 42 string input Number of characters in string lower(string) text Convert string to lower case position(substring in int Location of string) specified substring(string from int [for int]) substring(string from negative int [for int]) text text substring Extract substring starting at the from position, for the length specified by the for (defaults to rest of string) Extract substring starting at the from position counting backwards from the right of the string for the length specified by the for char_length('jose') 4 lower('tom') position('om' in 'Thomas') substring('thomas' from 2 for 3) substring( Thomas from 2) Substring( Thomas from -3 for 3) tom Page 46 Borrui Data Technology Co. Ltd hom homas mas

48 trim([leading trailing both] [characters] from string) trim([leading trailing both] [from] string [, characters] ) ltrim(string [,characters]) rtrim(string [,characters]) text text text text (defaults to rest of string) Remove the longest string containing only the characters (a space by default) from the start/end/both ends of the string Non-standard version of trim() Remove the longest string containing only the characters (a space by default) from the start of the string Remove the longest string containing only the characters (a space by default) from the end of the string upper(string) text Convert string to upper case decode(string text, bytea format text) Decode binary data from textual representation in string. Options for format are same as in encode. left(str text, n int) text Return first n characters in the string. When n is negative an exception will be returned. right(str text, n int) text Return last n characters in the string. When n is negative an exception will be trim(both 'x' from 'xtomxx') trim(both from 'xtomxx', 'x') ltrim( Tom ) ltrim( abtom, ab ) rtrim( Tom ) rtrim( Tomab, ab ) upper('tom') decode('mtizaae=', 'base64') Tom Tom Tom Tom Tom Tom TOM \x Page 47 Borrui Data Technology Co. Ltd 2016

49 returned. 6.5 Pattern Matching LIKE {string-expression} LIKE '{pattern}' [ESCAPE escape-character ] The LIKE expression returns true if the string-expression matches the supplied pattern. (As expected, the NOT LIKE expression returns false if LIKE returns true, and vice versa.) If pattern does not contain percent signs or underscores, then the pattern only represents the string itself; in that case LIKE acts like the equals operator. An underscore (_) in pattern stands for (matches) any single character; a percent sign (%) matches any sequence of zero or more characters. Some examples: 'abc' LIKE 'abc' true 'abc' LIKE 'a%' true 'abc' LIKE '_b_' true 'abc' LIKE 'c' false LIKE pattern matching always covers the entire string. Therefore, if it's desired to match a sequence anywhere within a string, the pattern must start and end with a percent sign. To match a literal underscore or percent sign the user must specify an escape character to be used as part of the pattern string by adding the ESCAPE clause after the pattern string. The respective character in the pattern must be preceded then by the escape character. Some examples: 'a_b' LIKE 'a\_%' ESCAPE \ 'abc' LIKE 'a\_%' ESCAPE \ true false 6.6 Date/Time Functions EXTRACT(from timestamp) EXTRACT( selection-keyword FROM timestamp-expression ) The EXTRACT() function returns the value of the selected portion of a timestamp. The table below lists the supported keywords, the datatype of the value returned by the function, and a description of its contents. Keyword Datatype Description YEAR INTEGER The year as a numeric value. Page 48 Borrui Data Technology Co. Ltd 2016

50 Keyword Datatype Description QUARTER INTEGER The quarter of the year as a single numeric value between 1 and 4. MONTH INTEGER The month of the year as a numeric value between 1 and 12. DAY INTEGER The day of the month as a numeric value between 1 and 31. WEEK INTEGER The week of the year as a numeric value between 1 and 52. HOUR INTEGER The hour of the day as a numeric value between 0 and 23. MINUTE INTEGER The minute of the hour as a numeric value between 0 and 59. SECOND DECIMAL The whole and fractional part of the number of seconds within the minute as a floating point value between 0 and CURRENT_TIMESTAMP CURRENT_TIMESTAMP Returns the current date and time as a timestamp NOW() NOW() Returns the current date and time as a timestamp. Equivalent to CURRENT_TIMESTAMP Interval Arithmetic Interval Types RapidsDB provides support for INTERVAL arithmetic as defined by the SQL-99 standard. There are two types of intervals: 1) YEAR-MONTH INTERVAL. e.g INTERVAL '1' YEAR e.g INTERVAL '2' MONTH e.g INTERVAL '1-2' YEAR TO MONTH 2) DAY_TIME INTERVAL e.g INTERVAL '5' DAY e.g INTERVAL '5 10:10' DAY TO MINUTE e.g INTERVAL '1 2:10:10.234' DAY TO SECOND and so on... Page 49 Borrui Data Technology Co. Ltd 2016

51 Precision: The user can specify a leading precision for any of the intervals. The default precision for all of DAY, HOUR, MINUTE, SECOND, YEAR, MONTH is 2. The maximum precision allowed is 9. The default fractional second precision is 6, and the maximum is 9. You can specify precision for the leading field and also for the SECOND field, the remaining fields will follow the default precision. e.g The following are valid intervals: INTERVAL '1-2' YEAR TO MONTH INTERVAL '13-2' YEAR TO MONTH INTERVAL '199-2' YEAR(3) TO MONTH INTERVAL '199' MONTH(3) note here the leading field is month and so we can specify a precision for month INTERVAL '1 10:10:10.234' DAY TO SECOND INTERVAL '123 10:10:10.234' DAY(3) TO SECOND INTERVAL '123 10:10: ' DAY(3) TO SECOND(8) INTERVAL '123 10:10: ' DAY(3) TO SECOND : the fractional second will round off to the default 6 digits precision, and you will get back: :10: Range: Can be negative or positive YEAR-MONTH interval: Can be negative or positive: year - constrained by precision. Hence with a precision of 9 the maximum value can be month - 0 to 11 (But if leading then constrained by precision). The following example is valid: INTERVAL '10-10' YEAR TO MONTH. but the following is invalid: INTERVAL '10-13' YEAR TO MONTH. The following is also valid: INTERVAL '13' MONTH This is valid because MONTH is the leading number, and so it is constrained by precision, and the leading default precision is 2. So you can have a max value of 99 in month. But if you specify precision of more than 2 it can be higher. For example, you can have: INTERVAL '999' MONTH(3) DAY-TIME interval: Can be negative or positive: Page 50 Borrui Data Technology Co. Ltd 2016

52 day - constrained by precision. Hence with a precision of 9, the maximum value can be hour - 0 to 23 minute - 0 to 59 second - 0 to Note that if hour, minute or second are leading then we can specify a precision other than the default for them. e.g INTERVAL '999' HOUR(3) Also note that we can give a fractional second precision: e.g INTERVAL '10: ' HOUR TO SECOND(3). We can also have: INTERVAL '10.89' SECOND(2,2) Also note that with the fractional second, if the number does not fit the precision, it will get rounded. e.g INTERVAL ' ' SECOND(2,4) will become ' ' Support for interval comparions: We can compare DATE-TIME intervals with DATE-TIME intervals. We can compare YEAR-MONTH intervals with YEAR-MONTH intervals Support for Interval Arithmetic: Operand Operator Operand Result Type Timestamp - Timestamp Interval Timestamp + Interval Timestamp Timestamp - Interval Timestamp Interval + Timestamp Timestamp Interval + Interval Interval Interval - Interval Interval Interval * Numeric Interval Numeric * Interval Interval Interval / Numeric Interval Note: When you do arithmetic on intervals, the resulting interval has a precision of the maximum allowed. e.g: select INTERVAL '12 10:10: ' DAY(3) TO SECOND + INTERVAL '10 10:12:10.345' DAY TO SECOND from x;? :22: Page 51 Borrui Data Technology Co. Ltd 2016

53 1 row(s) returned EXTRACT(from interval) EXTRACT( selection-keyword FROM interval-expression ) The EXTRACT() function returns the value of the selected portion of a timestamp. The table below lists the supported keywords, the datatype of the value returned by the function, and a description of its contents. Keyword From Interval Datatype Description YEAR Year-month INTEGER The year as a numeric value. QUARTER Year-month INTEGER The quarter as a numeric value. MONTH Year-month INTEGER The month as a numeric value DAY Day-time INTEGER The day as a numeric value HOUR Day-time INTEGER The hour as a numeric value. MINUTE Day-time INTEGER The minute as a numeric value. SECOND Day-time INTEGER The second as a numeric value String to Interval Functions Function Returned Interval Type Description TO_YMINTERVAL(string) YEAR-MONTH Converts a character string to a YEAR-TO- MONTH interval. The format of the string is any valid YEAR-MONTH interval (see ). TO_DTINTERVAL(string) DAY-TIME Converts a character string to a DAY- TIME interval. The format for the string is any valid DAY-TIME interval (see Page 52 Borrui Data Technology Co. Ltd 2016

Function Returned Interval Type Description 6.6.4.3). 6.6.4.7 BETWEEN Operator: BETWEEN interval1 AND interval2 The BETWEEN operator can return the value between two day-time or two year-month intervals.

54 Function Returned Interval Type Description ) BETWEEN Operator: BETWEEN interval1 AND interval2 The BETWEEN operator can return the value between two day-time or two year-month intervals. For example: SELECT. WHERE TS_INTERVAL BETWEEN INTERVAL '100 10:00: DAY TO SECOND AND INTERVAL '299 10:00:00.000' DAY TO SECOND 6.7 CONDITIONAL EXPRESSIONS CASE The CASE expression is a generic conditional expression, similar to if/else statements in other programming languages: CASE WHEN condition THEN result [WHEN...] [ELSE result] END CASE clauses can be used wherever an expression is valid. Each condition is an expression that returns a boolean result. If the condition's result is true, the value of the CASE expression is the result that follows the condition, and the remainder of the CASE expression is not processed. If the condition's result is not true, any subsequent WHEN clauses are examined in the same manner. If no WHEN condition yields true, the value of the CASE expression is the result of the ELSE clause. If the ELSE clause is omitted and no condition is true, the result is null. An example: Page 53 Borrui Data Technology Co. Ltd 2016

6.7.2 COALESCE COALESCE(value [,...]) The COALESCE function returns the first of its arguments that is not null. Null is returned only if all arguments are null.

55 6.7.2 COALESCE COALESCE(value [,...]) The COALESCE function returns the first of its arguments that is not null. Null is returned only if all arguments are null. It is often used to substitute a default value for null values when data is retrieved for display, for example: SELECT COALESCE(description, short_description, '(none)')... This returns description if it is not null, otherwise short_description if it is not null, otherwise the text (none) IF IF( boolean_expression, true_result_expression, false_result_expression ) IF evaluates the boolean_expression, and then evaluates one of the other two expressions to produce a result. If the boolean_expression is true, then the true_result_expression is evaluated and returned as the result; otherwise the false_result_expression is evaluated and returned as the result. true_result_expression and false_result_expression may be of any type but the two must match or be implicitly convertible to a common type. Notes: If boolean_expression evaluates to NULL, the result is the same as if it evaluated to false. Example: SELECT IF(1<2, 2, 3) This returns the value IFNULL IFNULL(value1, value2) The IFNULL function returns value2 if value1 is null; otherwise it returns value1, for example. SELECT IFNULL(description, (none) ) Page 54 Borrui Data Technology Co. Ltd 2016

56 This returns the string (none) if the value for the description column is null, otherwise it returns the value for the column NULLIF NULLIF(value1, value2) The NULLIF function returns a null value if value1 equals value2; otherwise it returns value1, for example. SELECT NULLIF(description, (none) ) This returns a null value if the value for the description column equals (none) otherwise it returns the value for the description column. 6.8 AGGREGATE FUNCTIONS Aggregate functions compute a single result from a set of input values. Function Argument Type(s) Return Type Description avg(expression) integer, decimal or float double precision for a floating-point argument, otherwise same as the argument data type the average (arithmetic mean) of all input values count(*) integer number of input rows count(expression) any integer number of input rows for which the value of expression is not null max(expression) min(expression) sum(expression) any numeric, string, or date/time types any numeric, string, or date/time types Integer, decimal or float types same as argument type same as argument type same as the argument data type maximum value of expression across all input values minimum value of expression across all input values sum of expression across all input values It should be noted that except for count, these functions return a null value when no rows are selected. In particular, sum of no rows returns null, not zero as one might expect. The COALESCE function can be used to substitute zero or an empty array for null when necessary. Page 55 Borrui Data Technology Co. Ltd 2016

57 The table below shows the data types returned for the aggregate functions based on the Connector type: Connector Type Function Arguments Result RapidsSE AVG, MAX, MIN, SUM Integer Integer AVG, MAX, MIN, SUM Float Float AVG, MAX, MIN, SUM Decimal Decimal, with canonical precision and scale of arguments MemSQL AVG, MAX, MIN, SUM Integer Integer AVG, MAX, MIN, SUM Float Float AVG, MAX, MIN, SUM Decimal Decimal, with canonical precision and scale of arguments (see VoltDB AVG, MAX, MIN, SUM Integer Integer AVG, MAX, MIN, SUM Float Float AVG, MAX, MIN, SUM Decimal Fixed precision of 38 and scale of 12 Stream AVG, MAX, MIN, SUM Integer Integer AVG, MAX, MIN, SUM Float Float AVG, MAX, MIN, SUM Decimal Decimal, with canonical precision and scale of arguments Multiple data sources 1 AVG, MAX, MIN, SUM Integer Integer AVG, MAX, MIN, SUM Float Float AVG, MAX, MIN, SUM Decimal Decimal, with canonical precision and scale of arguments Page 56 Borrui Data Technology Co. Ltd 2016

58 Note: 1. Multiple data sources is when the query includes tables from different Connectors, in which case the query will be executed within the RapidsDB Execution Engine. 6.9 SUB-QUERY EXPRESSIONS IN expression IN (subquery) The right-hand side is a parenthesized subquery, which must return exactly one column. The left-hand expression is evaluated and compared to each row of the subquery result. The result of IN is "true" if any equal subquery row is found. The result is "false" if no equal row is found (including the case where the subquery returns no rows). Note that if the left-hand expression yields null, or if there are no equal right-hand values and at least one right-hand row yields null, the result of the IN construct will be null, not false NOT IN expression NOT IN (subquery) The right-hand side is a parenthesized subquery, which must return exactly one column. The left-hand expression is evaluated and compared to each row of the subquery result. The result of NOT IN is "true" if only unequal subquery rows are found (including the case where the subquery returns no rows). The result is "false" if any equal row is found. Note that if the left-hand expression yields null, or if there are no equal right-hand values and at least one right-hand row yields null, the result of the NOT IN construct will be null, not true EXISTS EXISTS (subquery) The argument of EXISTS is an arbitrary SELECT statement, or subquery. The subquery is evaluated to determine whether it returns any rows. If it returns at least one row, the result of EXISTS is "true"; if the subquery returns no rows, the result of EXISTS is "false". 7. QUERY EXECUTION 7.1 RapidsDB SQL Statement Execution RapidsDB will parse a SQL statement and build a query execution plan to execute the SQL statement. When optimizing the execution plan the RapidsDB Optimizer attempts to push down execution of as much of the query logic as possible to the underlying Data Store using a minimum number of operations. Those parts of the query logic that cannot be pushed down will be executed by the RapidsDB Execution Engine. For example, when executing a JOIN that includes tables from two Page 57 Borrui Data Technology Co. Ltd 2016

59 different Connectors, the join will take place in the RapidsDB Execution Engine. EXPLAIN (see 2.1.5) can be used to see which parts of the query will be executed where and to inspect the SQL statements that will be sent to the underlying Data Store. For those parts of a query that are executed by the RapidsDB Execution Engine, there are two types of query plans which can be generated: 1. Partitioned Query Plans (see 7.2) 2. Non-Partitioned Query Plans (see 7.3) NOTE: It is important to understand that Partitioned and Non-Partitioned Query Plans only apply to those parts of the query plan that cannot be pushed down to the underlying data stores, and where the RapidsDB Execution Engine will be executing that part of the query plan. This typically applies to complex joins and sub-queries which the underlying data stores cannot handle. For queries that can be pushed down to the underlying data store, it is the responsibility of the underlying data store to parallelize the query execution where possible. 7.2 Partitioned Query Plans For underlying data stores, such as VoltDB, that allow single queries to be split up and executed in parallel against each of the partitions of the tables being queried, RapidsDB will generate a Partitioned Query Plan, where portions of the query plan will be executed in the RapidsDB Execution Engine in parallel against each partition of a table. The diagram below illustrates this: Example: select l_orderkey, l_partkey, p_mfgr from part join lineitem on p_partkey = l_partkey and p_mfgr = 'LG'; Page 58 Borrui Data Technology Co. Ltd 2016

60 The diagram above shows two DQE nodes each running a VoltDB Connector, with RapidsDB executing the join between the PART and LINEITEM tables. This join will use a Bloom Join (see 7.4.1) where each DQE node will retrieve the qualifying rows from the partitions of the PART table (select p_partkey, pmfgr from part where p_mfgr= LG ) on that node to build a Bloom Filter, and then each node will read the rows from the LINEITEM table (select l_partkey from lineitem) on that node and apply the Bloom Filter to get the joined rows. The joined rows will then be returned from each DQE back to the DQC node where the results will be merged to produce the final result. 7.3 Non-Partitioned Query Plans For other data stores where a single query cannot be split up and executed in parallel, RapidsDB will generate a Non-Partitioned Query plan where one query will be sent to the underlying data store, and it will be responsibility of the underlying data store to parallelize the execution of the query if possible. MemSQL is an example of a Data Store where the query cannot be split up by RapidsDB and executed in parallel, but which can parallelize the execution of a SQL statement. The diagram below illustrates this: Example: select l_orderkey, l_partkey, p_mfgr from part join lineitem on p_partkey = l_partkey and p_mfgr = 'LG'; Page 59 Borrui Data Technology Co. Ltd 2016

61 The diagram above shows the entire query getting pushed down to MemSQL (via the memsql Aggregator) and then MemSQL parallelizing the execution across all of the MemSQL Leaf nodes in the MemSQL cluster. 7.4 Combination of Partitioned and Non-Partitioned Plans When joining tables across Connectors where one Connector uses Partitioned plans and the other Connector uses Non-Partitioned plans (eg. VoltDB and MemSQL), the join will be executed by RapidsDB, and RapidsDB will push down as much of the processing as possible before having to execute the join. Example: select l_orderkey, l_partkey, p_mfgr from part join lineitem on p_partkey = l_partkey and p_mfgr = 'LG'; This is the same query as the examples above, but in this example the PART table is managed by VoltDB and the LINEITEM table is managed by MemSQL. Page 60 Borrui Data Technology Co. Ltd 2016

The diagram above shows that the data from the PART table will be retrieved in parallel (using a Partitioned plan) using a VoltDB Connector on the RapidsDB DQE nodes, and the data from the LINEITEM

62 The diagram above shows that the data from the PART table will be retrieved in parallel (using a Partitioned plan) using a VoltDB Connector on the RapidsDB DQE nodes, and the data from the LINEITEM table will be retrieved (using a Non-Partitioned plan) using a MemSQL Connector, with MemSQL accessing the data in parallel from the MemSQL Leaf nodes and then the join will take place on the RapidsDB DQC node using a Non-Partitioned plan. 7.5 RapidsDB Join Algorithms The following describes the join algorithms supported by the RapidsDB Execution Engine. These join algorithms will be used when the join processing cannot be pushed down to the underlying Data Stores Bloom Join The Bloom Join is the default join algorithm for Partitioned Query Plans (see 7.2). The Bloom Join offers good all-round performance compared to a Hash Join by minimizing network overhead at a cost of a minor increase in CPU usage on each DQE node. The Bloom Join achieves this by utilizing a Bloom Filter on each remote node to determine if a row on that remote node is likely to satisfy the join criteria Page 61 Borrui Data Technology Co. Ltd 2016

63 before that row gets serialized and sent across the network. This greatly minimizes the number of rows that are transmitted across the network only to fail the join criteria and then be discarded. A Bloom Filter is a probabilistic filter that uses an array of bits to define (in this case) a pattern of valid join values. A row on a remote DQE node gets the value of its join column hashed to produce a value which is then used as an index into the Bloom Filter s bit array. If the corresponding bit in the bit array is set (has a value of 1) then this indicates that the row may possibly satisfy the join criteria. However, if the corresponding bit in the bit array is cleared (has a value of 0) then the row definitely does not satisfy the join criteria. In this way, a Bloom Filter can represent a large set of join values in a very memory efficient way because it only needs to represent any given join value with a minimum of 1 bit. However, if the size of the bit array is too small then when a row that does not satisfy the join criteria is hashed it may produce a value that collides with a bit in the bit array that is already set. As a result the row gets transmitted across the network to the Bloom Join operator which would then double check its actual join value, recognize that it does not satisfy the join criteria, and then discard the row. This hash collision in the Bloom Filter is known as a false positive. Excessive false positives end up consuming network, memory and CPU resources for no net benefit. False positives can be minimized by using a larger bit array in the Bloom Filter and/or hashing the join value of each row multiple times to produce a set of index values into the bit array. i.e., more memory or CPU can be used to lower the incidence of false positives. The following describes how a Bloom Join works: 1. The Bloom Join operator issues a query to the left side table on the local DQE node and gets back a set of result rows. 2. The Bloom Join operator puts the rows from the left side table into a hash map, keyed off the value of the join column in each row. 3. The Bloom Join operator creates a Bloom Filter based upon the number of unique join values from the left side result set. If there are very few unique join values from the left side result set then a minimum Bloom Filter size is used (currently 10,000 unique values). 4. The Bloom Join operator then adds each of the unique join values from the hash map of the left side result set to the Bloom Filter. This has the effect of setting some of the bits in the bit array. 5. The Bloom Join operator then adds the Bloom Filter operator to the right side plan and tries to push it down the right side plan stack as far as possible. 6. The Bloom Join operator then executes the right side plan, which typically gets sent to each DQE node in the cluster before being executed. 7. Each DQE node in the cluster executes the right side plan, which selects rows from the right side table and filters the value of their join column through the Bloom Filter. Rows with join values that do not validate against the bit pattern set in the Bloom Filter get discarded. Rows with join values that do validate against the bit pattern of the Bloom Filter may or may not have a join value that matches a left side row. In any case these rows must be returned to the DQE node where the Bloom Join operator is executing. 8. The Bloom Join operator receives a set of rows from each DQE node whose join values are validated against the bit pattern in the Bloom Filters. Page 62 Borrui Data Technology Co. Ltd 2016

64 9. For each row received from the right side plan the Bloom Join operator extracts the value of its join column and tries to match it to a row from the left side result set using the local hash map it previously built. If it finds matching rows then the left and right side rows are joined together and returned. If there are no matching rows then the right side row is discarded. The Bloom Join is an efficient join algorithm because its use of a Bloom Filter minimizes use of a bottleneck resource (the network) at the expense of using more lower cost resources (CPU and memory). The filtering of rows also occurs in parallel on each DQE node, so large result sets can still be filtered quickly. The implementation of the Bloom Filter in the Bloom Join aims for a rate of 1% false positive rows. This is a reasonable trade-off of memory/cpu consumption to network overhead of returning additional rows. This goal of a 1% false positive rate is not guaranteed though, as the size of the Bloom Filter is set based upon the number of unique join values from the left side table and it does not anticipate the number of unique join values that may have to be filtered from the right side table. To do so would mean that DQS would need to keep some statistics about the cardinality of unique values for joinable columns in each table, or it would need to read the right side table twice once to determine how many unique join values there are and then again to filter them through the Bloom Filter. The latter would produce an unacceptable tradeoff to statement latency, extra memory usage and CPU consumption. As a result, one scenario where the Bloom Filter runs less efficiently than it otherwise would is where the number of unique join values from the left side table is very small (say, less than 100), resulting in a Bloom Filter size being set to a minimum default value of 10,000. Then if the number of unique join values on the right side table is very large (say, greater than 2,000), the size of the Bloom Filter is too small to operate close to the goal of a 1% false positive rate. As a result, more rows will validate against the bit pattern in the Bloom Filter and will get returned to the Bloom Join. i.e., the actual false positive rate will greatly increase (in this example a 10% or larger false positive rate would not be unexpected). The worst case scenario is where the Bloom Filter does not have enough bits in its bit array to represent all the possible unique join values from the right side table. In this case, all rows from the right side table will be validated by the Bloom Filter and then returned to the Bloom Join operator across the network. This scenario is then functionally equivalent to a Hash Join and it has a very similar performance profile to the Hash Join. Because of this, the Bloom Join operator is an efficient, all-round join algorithm and therefore it was chosen to be the default join type for Partitioned Query Plans (see 7.2). The diagram below shows an example query using the Bloom Join. The query being executed is: SELECT o_clerk, sum(l_quantity) FROM orders /*+ bloom */ join lineitem on o_orderkey = l_orderkey Page 63 Borrui Data Technology Co. Ltd 2016

65 WHERE o_orderkey < GROUP BY o_clerk; In this example RapidsDB is executing the query on a 3 node cluster, where a Bloom Join operator on each DQE node is joining the left side table from that node to the right side table from all other nodes. The diagram below shows how a Bloom Filter operator that gets created from the left side row results gets pushed down the right side plan and executed on each node. It also shows how this Bloom Filter eliminates the vast majority of rows that will not satisfy the join condition, and it does so on each local node before returning valid rows across the network. Page 64 Borrui Data Technology Co. Ltd 2016

66 7 rows returned AggregateOp 3M rows returned MergeOp 1M rows returned 1M rows returned 1M rows returned BloomJoinOp BloomJoinOp BloomJoinOp orders.o_orderkey = lineitem.l_orderkey orders.o_orderkey = lineitem.l_orderkey orders.o_orderkey = lineitem.l_orderkey 1M rows returned MergeOp 333K rows returned 83K distinct l_orderkey values 333K rows returned 83K distinct l_orderkey values 333K rows returned 83K distinct l_orderkey values BloomFilterOp BloomFilterOp BloomFilterOp 250K rows returned 250K distinct o_orderkey values 2M rows returned 500K distinct l_orderkey values 2M rows returned 500K distinct l_orderkey values 2M rows returned 500K distinct l_orderkey values select from orders where o_orderkey < ; select from lineitem; select from lineitem; select from lineitem; 500K rows 500K distinct o_orderkey values 2M rows 500K distinct l_orderkey values 2M rows 500K distinct l_orderkey values 2M rows 500K distinct l_orderkey values DQE Node 1 DQE Node 2 DQE Node 3 Bloom Join Across 3 DQE Nodes Hash Join The Hash Join is the default join algorithm for Non-Partitioned query plans (see 7.3). With a Hash Join the following is executed on each DQE node: The rows that qualify from the WHERE predicate on the right-side of the join are retrieved from all partitions of the Data Store on all of the nodes, and a hash map is created from the retrieved data using the key column for the right-side of the join. Page 65 Borrui Data Technology Co. Ltd 2016

67 Data that qualifies from the WHERE predicate on the left-side of the join is extracted for all partitions of the Data Store on the current node, The join keys from the left-side of the join are then matched against the keys from the right-side of the join using the hash map. Example: SELECT l_orderkey, SUM(l_extendedprice) AS revenue, o_orderdate, o_shippriority FROM lineitem /*+ hash */ join orders ON l_orderkey = o_orderkey WHERE o_orderdate < timestamp ' :00:00' AND l_shipdate > timestamp ' :00:00' GROUP BY l_orderkey, o_orderdate, o_shippriority; In this example using VoltDB, the lineitem table is not partitioned on l_orderkey, and so this join will be processed by the RapidsDB Execution Engine. The following will be executed on each DQE node: Get the data from the right-side of the join by executing the following query against all partitions of the Data Store on all of the nodes: SELECT o_orderkey, o_orderdate, o_shippriority FROM orders WHERE o_orderdate < timestamp ' :00:00' This data will be put in a hash map on the DQE using o_orderkey as the key to the hash map. Get the data for the left-side of the join by executing the following query against all partitions of the Data Store on the local DQE node: SELECT l_orderkey, l_extendedprice FROM lineitem WHERE l_shipdate > timestamp ' :00:00' Perform a lookup in the hash map table based on l_orderkey to find all of the matching rows. 8. INSERT The user can insert data into tables in the schema managed by the following Connectors: RapidsSE MemSQL VoltDB Page 66 Borrui Data Technology Co. Ltd 2016

68 The syntax for the INSERT command is: INSERT INTO [catalog.][schema.]<table name> [(col_name,...)] VALUES (expr,...),(...),... INSERT INTO [catalog.][schema.]<table name> [(col_name,...)] SELECT select-query col_name = insert_expr [, col_name = insert_expr]... ] select-query: any valid select query as defined by section 5 NOTES: 1. The catalog and schema names are used to identify which Connector the INSERT command should be sent to. The catalog name is only needed in the situation where the schema name is not unique. 2. For an INSERT SELECT, the data types in the result set from the SELECT must match the data types for the target insert table. 3. FOR INSERT SELECT the tables specified in the INSERT and SELECT clauses can be in different schema managed by different Connectors. 4. When inserting a date value the keyword DATE must be used to qualify the date value. For example: INSERT INTO test.t1 VALUES (1, DATE ' ); If the TIMESTAMP keyword is used then the following error will be returned: Invalid timestamp (format is YYYY-MM-DD HH:MM:SS[.fffffffff]) If a the date is entered with no DATE or TIMESTAMP keyword then a system exception will be returned reporting the same invalid timestamp message. Example: INSERT INTO test.t1 VALUES (1, test text, ' :00:00'); The INSERT would be sent to the Connector managing the schema named test to insert data into table t1 INSERT INTO test.t1 VALUES (1, test text, ' :00:00'),(2, text, :00:00 ); The INSERT for two rows would be sent to the Connector managing the schema named test to insert data into table t1 INSERT INTO memsql.test.t1 (c1,c2) VALUES (1, test text ); The INSERT would be sent to the MemSQL Connector managing the schema named test to insert data into table t1 for columns c1 and c2, with default values for any other columns in table t1. Page 67 Borrui Data Technology Co. Ltd 2016

69 INSERT INTO rapidsse.results.t1 SELECT t1, t2, t3 FROM memsql.test.t2; The INSERT would be sent to the RapidsSE Connector managing the schema named results to insert into the table t1 the results from the specified select query which was executed by the MemSQL Connector. 9. DDL The user can create and drop tables in the schema managed by the following Connectors: RapidsSE MemSQL VoltDB 9.1 CREATE TABLE The syntax for the CREATE TABLE command is: CREATE TABLE [IF NOT EXISTS] [catalog.][schema.]<table name> (create_definition,...); create_definition: col_name column_definition column_definition: data_type data_type: INTEGER[(length)] FLOAT[(length,decimals)] DECIMAL[(length[,decimals])] TIMESTAMP BOOLEAN VARCHAR(length) VARBINARY(length) NOTES: 1. The catalog and schema names are used to identify which Connector the CREATE TABLE command should be sent to. The catalog name is only needed in the situation where the schema name is not unique. 2. After creating the table, the metadata for the associated Connector will be refreshed, and there is no need to manually run the REFRESH command. 3. For the Integer, Float, Decimal, Varchar and Varbinary data types the actual size, precision and scale of the columns will be determined by the underlying data store and can be different from the value specified by the user. Column information in the COLUMNS table in the RapidsDB Page 68 Borrui Data Technology Co. Ltd 2016

70 System Metadata (see 11.8) for the created table reflects the type, size, precision and scale of columns as reported by the underlying data store and interpreted by RapidsDB. The following query can be used to see the column information for the created table: SELECT * FROM rapids.system.columns WHERE catalog_name = <catalog name for new table> AND schema_name = <schema name for new table> AND table_name = <name of new table> ; 4. When creating a table in RapidsSE, the schema to be used must have already been created in RapidsSE (refer to CREATE SCHEMA in the Installation and Management Guide). 5. The table below shows the data types supported and defaults for the data types for RapidsSE: Data Type Default INTEGER 18 digits FLOAT 53 for the mantissa DECIMAL Precision = 19, scale = 0 BOOLEAN N/A TIMESTAMP N/A VARCHAR 1MB Example: CREATE TABLE test.t1(c1 integer not null, c2 varchar(64), c3 timestamp); This command would be sent to the Connector managing the schema named test to create the table t1 CREATE TABLE memsql.test.t1(c1 integer not null, c2 varchar(64), c3 timestamp); This command would be sent to the MemSQL Connector that is managing the schema test to create the table t1 CREATE TABLE rapidsse.test.t1(c1 integer not null, c2 varchar, c3 timestamp); This command would be sent to the RapidsSE Connector to create the table t1 in the schema named test 9.2 DROP TABLE The syntax for the DROP TABLE command is: DROP TABLE [IF EXISTS] [catalog.]schema.<table name>; NOTES: Page 69 Borrui Data Technology Co. Ltd 2016

71 1. The catalog and schema names are used to identify which Connector the DROP TABLE command should be sent to. The catalog name is only needed in the situation where the schema name is not unique. 2. After dropping the table, the metadata for the associated Connector will be refreshed, and there is no need to manually run the REFRESH command. Example: DROP TABLE test.t1; This command would be sent to the Connector managing the schema named test to drop the table t1 DROP TABLE memsql.test. t1; This command would be sent to the MemSQL Connector that is managing the schema test to drop the table t1 10. REFRESH Command The REFRESH command must be used when the underlying table metadata in any Data Store has been changed, where the change was not the result of a CREATE or DROP table request that was sent from RapidsDB (see section 9) and the user wishes to access the metadata information from RapidsDB. An example would be when a new schema is created in RapidsSE, or if the user created a new table in MemSQL using the MemSQL command interface. The syntax for the Refresh command is: refresh; 11. SYSTEM METADATA TABLES 11.1 OVERVIEW RapidsDB provides a set of system metadata tables which provide metadata information about the RapidsDB system which is similar to the information provided by the ANSI Information Schema. The system metadata tables reside in the RAPIDS.SYSTEM catalog and schema. Each Federation has a METADATA Connector that maintains the system metadata tables for that Federation. The table below lists the system metadata tables: Table Name NODES FEDERATIONS CONNECTORS CATALOGS SCHEMAS TABLES Description A list of all of the nodes in the RapidsDB Cluster A list of all the Federations A list of all of the Connectors in the current Federation A list of the catalogs that can be accessed from the current Federation A list of the schemas that can be accessed from the current Federation A list of the tables that can be accessed from the current Federation. This list will provide the unique table names. If there is an ambiguous table name, then Page 70 Borrui Data Technology Co. Ltd 2016

COLUMNS TABLE_PROVIDERS that table name will only show up once and you will need to check the Table_Providers table to get a complete list of table names, including any duplicate names.

72 COLUMNS TABLE_PROVIDERS that table name will only show up once and you will need to check the Table_Providers table to get a complete list of table names, including any duplicate names. A list of all the columns that can be accessed from the current Federation A list of all of the tables from each Connector, including any duplicates. The system metadata tables are treated the same as any other tables by RapidsDB, and as for any user tables, it is only necessary to include the catalog and/or schema name when there are multiple tables in the current Federation that use the same name. Assuming that system metadata table names are all unique within the current Federation, then the following queries will all be successful: i) select * from rapids.system.tables; ii) select * from system.tables; iii) select * from tables; 11.2 NODES Table The NODES table contains a list of the nodes in the RapidsDB Cluster. The table below shows the columns in the NODES table: Column Name NODE_NAME IS_DQC HOSTNAME CLUSTER_PORT INSTALLATION_DIR WORKING_DIR Example: Description The name assigned by the user to this node Set to true if this node is the DQC node, otherwise set to false The host name or ip address for this node The port number that this node will be listening on The installation directory for the RapidsDB Cluster software The working directory used for the RapidsDB Cluster software This example shows a 4-node RapidsDB Cluster with the node having ip address being assigned the node name NODE1 and acting as the DQC node. The other 3 nodes are the DQE nodes FEDERATIONS Table The FEDERATIONS table contains a list of the Federations in the RapidsDB Cluster. The table below shows the columns in the FEDERATIONS table: Column Name Description Page 71 Borrui Data Technology Co. Ltd 2016

73 FEDERATION_NAME IS_DEFAULT The name of the Federation. By default there will always be one Federation named DEFAULTFED. Set to true if this is the default Federation, otherwise set to false Example: 11.4 CONNECTORS Table The CONNECTORS table contains a list of the Connectors in the RapidsDB Cluster. The table below shows the columns in the CONNECTORS table: Column Name FEDERATION_NAME CONNECTOR_NAME CONNECTOR_TYPE CONNECTOR_DDL Description The name that of the Federation that this Connector belongs to The name of the Connector The type of Connector, such as VOLTDB, MEMSQL or METADATA The CREATE CONNECTOR command that was used to create this Connector The query below shows the sample output for querying this table: In this example there are 3 Connectors in the DEFAULTFED Federation: 1. METADATA manages the metadata for the DEFAULTFED Federation. 2. MEMREG a MemSQL Connector 3. STREAM1 a Stream Connector 11.5 CATALOGS Table The CATALOGS table contains a list of the catalogs in the current Federation. The table below shows the columns in the CATALOGS table: Column Name CATALOG_NAME Description The name of the catalog Page 72 Borrui Data Technology Co. Ltd 2016

The query below shows the sample output for querying this table: In this example there are 3 catalogs: 1. STREAM this is the catalog for the Stream Connector 2.

74 The query below shows the sample output for querying this table: In this example there are 3 catalogs: 1. STREAM this is the catalog for the Stream Connector 2. MEMSQL this is the catalog for the MemSQL Connector 3. RAPIDS this is the catalog for the METADATA Connector 11.6 SCHEMAS Table The SCHEMAS table contains a list of the schemas in the current Federation. The table below shows the columns in the SCHEMAS table: Column Name CATALOG_NAME SCHEMA_NAME Description The name of the catalog The name of the schema The query below shows the sample output for querying this table: In this example there are 3 schemas: 1. PUBLIC this is the schema for the Stream Connector 2. SYSTEM this is the schema for the Metadata Connector 3. REGR this is the schema for the MemSQL Connector 11.7 TABLES Table The TABLES table contains a list of the tables that can be accessed from the current Federation. The table below shows the columns in the TABLES table: Column Name CATALOG_NAME SCHEMA_NAME TABLE_NAME IS_PARTITIONED Description The name of the catalog for this table The name of the schema for this table The name of the table Set to true if this table is partitioned, otherwise it is set to false Page 73 Borrui Data Technology Co. Ltd 2016

75 11.8 COLUMNS Table The COLUMNS table contains a list of the columns for all of the tables that can be accessed in the current Federation. The table below shows the columns in the COLUMNS table: Column Name Description CATALOG_NAME The name of the catalog for this table SCHEMA_NAME The name of the schema for this table TABLE_NAME The name of the table COLUMN_NAME The name of the column DATA_TYPE The data type for the column ORDINAL The column number (one-based) IS_PARTITION_KEY Set to true if this column part of the partition (shard) key Page 74 Borrui Data Technology Co. Ltd 2016

11.9 TABLE_PROVIDERS Table The table below shows the columns in the TABLE_PROVIDERS table: Column Name CATALOG_NAME SCHEMA_NAME TABLE_NAME CONNECTOR_NAME Description The name of the

76 11.9 TABLE_PROVIDERS Table The table below shows the columns in the TABLE_PROVIDERS table: Column Name CATALOG_NAME SCHEMA_NAME TABLE_NAME CONNECTOR_NAME Description The name of the catalog for this table The name of the schema for this table The name of the table The name of the Connector that is managing access to this table Page 75 Borrui Data Technology Co. Ltd 2016

12. Performance Tuning 12.1 Using EXPLAIN AND STATS The user can look at the query plan being generated for a query by using EXPLAIN (see 2.1.3), and can examine the execution of the query using the STATS command (see 2.

77 12. Performance Tuning 12.1 Using EXPLAIN AND STATS The user can look at the query plan being generated for a query by using EXPLAIN (see 2.1.3), and can examine the execution of the query using the STATS command (see 2.1.4) JOIN Order For SELECT statements with JOINs the user should order the tables in the query from left to right so that the table with the smallest number of rows, given the supplied WHERE clause predicates, is to the left and the table with the largest number of rows is the last table. For example, with the following query: SELECT l_orderkey, SUM(l_extendedprice) AS revenue, o_orderdate, o_shippriority FROM customer join orders ON c_custkey = o_custkey join lineitem Page 76 Borrui Data Technology Co. Ltd 2016

78 ON l_orderkey = o_orderkey WHERE c_mktsegment = 'FURNITURE' AND o_orderdate < timestamp ' :00:00' AND l_shipdate > timestamp ' :00:00' GROUP BY l_orderkey, o_orderdate, o_shippriority; If the rows returned from the customer table are the smallest and the rows returned from the lineitem table are the largest, then the query should be changed to the following in order to achieve the best query performance: SELECT l_orderkey, SUM(l_extendedprice) AS revenue, o_orderdate, o_shippriority FROM lineitem join orders ON l_orderkey = o_orderkey join customer ON c_custkey = o_custkey WHERE c_mktsegment = 'FURNITURE' AND o_orderdate < timestamp ' :00:00' AND l_shipdate > timestamp ' :00:00' GROUP BY l_orderkey, o_orderdate, o_shippriority; 12.3 JOIN Hints The user can provide a hint to the query planner as to the type of join that should be used for a particular join in the query. A join hint is specified with the following syntax: SELECT /*+{BLOOM HASH}*/ JOIN. The join hint is positioned immediately before the JOIN keyword that the hint is to be applied to. Example: SELECT l_orderkey, SUM(l_extendedprice) AS revenue, o_orderdate, o_shippriority Page 77 Borrui Data Technology Co. Ltd 2016

79 FROM lineitem /*+HASH*/ join orders ON l_orderkey = o_orderkey /*+HASH*/ join customer ON c_custkey = o_custkey WHERE c_mktsegment = 'FURNITURE' AND o_orderdate < timestamp ' :00:00' AND l_shipdate > timestamp ' :00:00' GROUP BY l_orderkey, o_orderdate, o_shippriority 12.4 Restrict Amount of Data Care should be taken to restrict the amount of data in JOINs by using the appropriate predicates in the WHERE clause. Failure to do this could result in too much data being requested from the Data Store which could cause one or more of the DQE nodes to fail due to exhausting their heaps. In the previous example, the timestamps in the WHERE clause should be further restricted, for example: WHERE c_mktsegment = 'FURNITURE' AND o_orderdate < timestamp ' :00:00' AND o_orderdate > timestamp ' :00:00' AND l_shipdate < timestamp ' :00:00' AND l_shipdate > timestamp ' :00:00' 13. Error Messages 13.1 RapidsDB shell Messages Failed to submit statement The RapidsDB shell program was unable to send the statement to the DQC. The likely cause is a failed node or a network problem. The bootstrapper (refer to the RapidsDB Installation Manual) HEALTHCHECK option can be used to verify that the cluster is operating normally. System exception on node <hostname:port>: <exception>: <message> A system exception occurred. This most likely indicates a software issue but may occur because of a network problem. Before resubmitting the query, the bootstrapper HEALTHCHECK option should be used to verify that the cluster is operating normally. (Note: system exceptions are normally followed by "traceback" information. If possible, this information should be preserved and included with any bug report.) DQS exception on node <hostname:port>: <exception>: <message> An exception occurred while processing the query. The query can be retried. Subsystem exception on node <hostname:port>: <exception>: <message> An exception occurred while processing the query. The query can be retried. The most common source of subsystem errors is exceptions from the underlying Data Store(see Data Store messages below). Page 78 Borrui Data Technology Co. Ltd 2016

80 13.2 Query Rejection Messages In addition to the above, statements submitted to the RapidsDB shell may be rejected with any of the messages below. Unrecognized token A token (i.e. keyword or term) in the query was misplaced, misspelled or contained illegal characters. Syntax error near <token> The parser could not recognize the syntax of the statement. The last token (i.e. keyword or term) recognized is shown. Check that the structure of the statement is correct, keywords and table names are correctly spelled and no table or columns have the same name as a SQL keyword. Syntax error at <token> expected <tokens> The parser could not recognize the syntax of the statement. A recognizable token (i.e. keyword or term) was encountered, but it was not one of the tokens expected. Check that the structure of the statement is correct, keywords and table names are correctly spelled and no table or columns have the same name as a SQL keyword. Reference to undefined table or stream: <name> The statement refers to a table name that is neither a table in the database nor a properly defined table or subquery alias within the query. Check that all table names or aliases are correctly spelled. Reference to undefined column: <name> The statement refers to a variable or column name that is not defined in any of the tables or subqueries in the statement. Check that column names are correctly spelled. Ambiguous reference to table or stream: <name> The statement refers to table name that is defined in more than catalog and schema. The table name must be qualified with either the schema name or the catalog and schema name to disambiguate the table name. Ambiguous reference to column: <name> The statement refers to a variable or column name that is defined in more than one table or subquery. Qualify the column name with a table name or a table or subquery alias. Type mismatch in <name> The arguments to the named operator or function were of the wrong type. For details on the number and type of function arguments (see 3.2 for more information on operators and functions). Wrong number of arguments for function: <name> The wrong number of arguments were supplied for the named function. For details on the number and type of function arguments, (see 3.2 for more information on operators and functions). Unsupported operator or function: <name> The named operator or function is valid in SQL but is not supported in DQS. JOIN predicate must be boolean expression The expression in the ON clause of a join must be of boolean type (i.e. must produce a true or false result). Page 79 Borrui Data Technology Co. Ltd 2016

81 WHERE predicate must be boolean expression The expression in the WHERE clause must be of boolean type (i.e. must produce a true or false result). FULL OUTER join not supported RapidsDB does not support full outer joins. Only inner joins and left or right outer joins are supported. Invalid GROUP BY expression The expression specified in the GROUP BY clause contains an aggregate function (COUNT, MIN, MAX, SUM or AVG). Only scalar operators and functions can be used in GROUP BY. Expression not in aggregate of GROUP BY column. In a query or subquery that specifies aggregation (COUNT, MIN, MAX, SUM or AVG functions) and/or grouping (GROUP BY clause), each expression in the SELECT list must be either an aggregate or a duplicate of one of the GROUP BY expressions. In other words, each SELECT expression must produce only a single value per group (or, if there is no grouping, a single value for the entire query). Invalid column index in ORDER BY The ORDER BY <column number> clause was used but the specified column number is greater than the number of columns produced by the query or subquery. LIMIT value must not be negative The LIMIT clause specified a negative limit. The value in the LIMIT clause must be zero or positive. OFFSET value must not be negative The OFFSET clause specified a negative offset. The value in the OFFSET clause must be zero or positive. Select list count mismatch The statement contains a UNION clause in which the query on the left side produces a different number of columns than the query on the right side. The queries on either side of a UNION must produce a matching number of columns. Select list type mismatch The statement contains a UNION clause in which the query on the left side produces columns whose types are incompatible with the columns produced by the query on the right side. The queries on either side of a UNION must produce columns with matching types. Any numeric type is considered a match for any other numeric type. All other types must match exactly. EXCEPT not supported The DQS does not support the EXCEPT clause. INTERSECT not supported The DQS does not support the INTERSECT clause. Page 80 Borrui Data Technology Co. Ltd 2016

82 13.3 Data Store-Related Messages Note: messages related to the underlying Data Store may occur as either DQS Exceptions or Subsystem Exceptions (see above). Timed out waiting for queries sent to VoltDB to complete A query that was sent to the Data Store took an excessive amount of time to produce a result. The query can be retried. This may indicate that Data Store is too heavily loaded on one or more hosts. If the error recurs, consider reducing the number of concurrent queries (to reduce the number of queries that are concurrently executed by RapidsDB, change -q option passed to the boostrapper see RapidsDB Installation and Management Manual). No connections. The Data Store has disconnected from one or more DQS nodes due to inactivity. The query can be retried. SQL ERROR More than nnn MB of temp table memory used while executing SQL. The intermediate results of the query are too large for the internal tables in the Data Store. You can adjust the maximum size of the internal tables in the Data Store to accomodate the query in the Data Store deployment file. (For more information, see RapidsDB Installation and Management Manual) SQL ERROR Output from SQL stmt overflowed output/network buffer of 50mb The final result of the query is too large for the Data Store output buffers. You may be able to limit the size of the result using the LIMIT clause or break the query into sections using the LIMIT, OFFSET and UNION clauses. Page 81 Borrui Data Technology Co. Ltd 2016

83 Appendix A Sample Scripter Program // Copyright 2014 Boray Data Co. Ltd. All rights reserved. package com.rapidsdata.cli; import java.io.bufferedreader; import java.io.ioexception; import java.io.inputstreamreader; import java.io.printwriter; /** * An example implementation of how to script the RapidsDB shell from another program. * In the following, the term CLI refers to the RapidsDB shell. * The RapidsDB shell can be launched with the following command (assuming the current * working directory is the "current" directory where RDP is installed, * usually /opt/rdp/current): * * java -cp.:./bin/*:./lib/* com.rapidsdata.cli.scripter "./rapids-shell.sh -s -c cfg/cluster.config" * * Note the double quotes around the command to run the RapidsDB shell. This Scripter * application expects the first argument to be the complete command to run the RapidsDB shell. * * If you are not running from the "current" directory in the RDP installation * (e.g., if you are making your own version of Scripter) then the classpaths * need to be modified in order to find all the dependencies. * Assuming that a shell variable points to the installation directory of RDP * (e.g., RDP_DIR=/opt/rdp/current) then it might look something like this: * * java -cp.:$rdp_dir:$rdp_dir/bin/*:$rdp_dir/lib* com.rapidsdata.cli.scripter "java cp $RDP_DIR:$RDP_DIR/bin/*:$RDP_DIR/lib/* com.rapidsdata.cli.cli s c $RDP_DIR/cfg/cluster.config" * */ public class Scripter { /** The command that launches the CLI. */ private String clicommand; /** An array of commands to execute. */ private String[] commands; /** The CLI sub-process. */ private Process cli; /** The stdin stream of the cli we can write to. */ private PrintWriter cliin; /** The stdout stream of the cli we can read from. */ private BufferedReader cliout; /** The stderr stream of the cli we can read from. */ private BufferedReader clierr; Page 82 Borrui Data Technology Co. Ltd 2016

84 /** Has the CLI exited yet? */ private boolean cliexited; /** How many rows have we processed? */ private int rowsprocessed; /** * The main entry point of the program. args command line arguments. */ public static void main(string[] args) throws IOException, InterruptedException { if (args.length < 1) { System.err.println("Must specify at least one argument."); printusage(); System.exit(1); } // Argument[0] is the command to launch the RapidsDB shell. // e.g., "java cp $RDP_DIR:$RDP_DIR/bin/*:$RDP_DIR/lib/* com.rapidsdata.cli.cli -s -c cfg/cluster.config" // where: // $RDP_DIR is the path to the RDP installation directory (usually /opt/rdp/current) // -s = run in scripting mode // -c = path to the cluster config file. String[] commands = { "SELECT * FROM part JOIN partsupp ON part.p_partkey = partsupp.ps_partkey WHERE part.p_size > 25;", "SELECT c_custkey + 1 FROM customer where c_custkey = 1;", "SELECT error FROM nonexistanttable;" }; Scripter scripter = new Scripter(args[0], commands); scripter.executeall(); } /** * Print the usage of this programs arguments to stderr. */ public static void printusage() { System.err.println(); System.err.println("Arguments to this application:"); System.err.println(" arg0 = CLI command line"); } /** * Constructor clicommand The command that launches the CLI. */ Page 83 Borrui Data Technology Co. Ltd 2016

85 public Scripter(String clicommand, String[] commands) { this.clicommand = clicommand; this.commands = commands; } /** * Execute all SQL commands given. */ public void executeall() throws IOException, InterruptedException { for (String command : commands) { reset(); execute(command); } } /** * Resets the relevant member variables for the next command. */ private void reset() { cli = null; if (cliin!= null) { cliin.close(); cliin = null; } if (cliout!= null) { try { cliout.close(); } catch(ioexception e) { /* do nothing */ } cliout = null; } if (clierr!= null) { try { clierr.close(); } catch(ioexception e) { /* do nothing */ } clierr = null; } cliexited = false; rowsprocessed = 0; } /** * Executes a single SQL command via the CLI. command The SQL command to execute. */ private void execute(string command) throws InterruptedException, IOException { Page 84 Borrui Data Technology Co. Ltd 2016

86 // Start the CLI startcli(); System.out.println("\nExecuting the command: " + command + "\n"); // Send the command down the CLI's stdin. // We must send a CR character as the CLI reads whole lines at a time. // Don't forget to flush the stream either, otherwise the CLI // won't get it. cliin.print(command + "\n"); cliin.flush(); // While the CLI is running... while(cliexited == false) { if (cliout.ready() == false && clierr.ready() == false) { // Nothing to read yet, so sleep for a bit. Thread.sleep(10); continue; } // Read from the CLI's stdout and stderr. processstdout(); processstderr(); } // Read any final data from the CLI processstdout(); processstderr(); System.out.println(); System.out.println(rowsProcessed + " rows processed."); // Read the CLI's error code. int errorcode = cli.exitvalue(); System.out.println("CLI exited with error code " + errorcode); } /** * Starts the CLI process and gets its stdin/stdout/stderr streams. */ private void startcli() { try { // Start the CLI process. cli = Runtime.getRuntime().exec(cliCommand); } catch(ioexception e) { System.err.println("Error starting the CLI: " + e.getmessage()); e.printstacktrace(system.err); System.exit(1); } Page 85 Borrui Data Technology Co. Ltd 2016

87 // Now get the cli's stdin, stdout and stderr that we can read and write to. cliin = new PrintWriter(cli.getOutputStream()); cliout = new BufferedReader(new InputStreamReader(cli.getInputStream())); clierr = new BufferedReader(new InputStreamReader(cli.getErrorStream())); // Start a thread to wait for the CLI to exit and set a flag when this occurs. new Thread(new ProcessExitWatcher()).start(); } /** * Process anything to read from the CLI's stdout. IOException */ private void processstdout() throws IOException { while (cliout.ready()) { String rowstring = cliout.readline(); System.out.println("Read row: [" + rowstring + "]"); ++rowsprocessed; } } /** * Process anything to read from the CLI's stderr. IOException */ private void processstderr() throws IOException { if (clierr.ready()) { char[] errarray = new char[1024]; String errstring = ""; int charsread = -1; do { // read as many characters as possible into the char array. charsread = clierr.read(errarray, 0, errarray.length); // concatenate to the existing string read in. if (charsread > 0) { errstring += new String(errArray, 0, charsread); } // repeat if we read a full array of characters. } while (charsread == errarray.length); System.err.println("Read the error: " + errstring); } } /** * Nested class that simply waits for the CLI to exit and then sets a flag indicating such. */ private class ProcessExitWatcher implements Runnable Page 86 Borrui Data Technology Co. Ltd 2016

88 { /** * {@inheritdoc} public void run() { try { // wait for the CLI to exit. cli.waitfor(); } catch (InterruptedException e) { e.printstacktrace(); } // now set a flag in the outer class to indicate that the CLI has exited. cliexited = true; } } } Page 87 Borrui Data Technology Co. Ltd 2016

89 Appendix B SQL Grammar Page 0 Borrui Data Technology Co. Ltd 2016

90 Page 1 Borrui Data Technology Co. Ltd 2016

91 Page 2 Borrui Data Technology Co. Ltd 2016

92 Page 3 Borrui Data Technology Co. Ltd 2016

93 Page 4 Borrui Data Technology Co. Ltd 2016

94 Page 5 Borrui Data Technology Co. Ltd 2016

95 Page 6 Borrui Data Technology Co. Ltd 2016

96 Page 7 Borrui Data Technology Co. Ltd 2016

97 Page 8 Borrui Data Technology Co. Ltd 2016

98 Page 9 Borrui Data Technology Co. Ltd 2016

99 Page 10 Borrui Data Technology Co. Ltd 2016

100 Page 11 Borrui Data Technology Co. Ltd 2016

101 Page 12 Borrui Data Technology Co. Ltd 2016

102 Page 13 Borrui Data Technology Co. Ltd 2016

103 Page 14 Borrui Data Technology Co. Ltd 2016

104 Page 15 Borrui Data Technology Co. Ltd 2016

105 Page 16 Borrui Data Technology Co. Ltd 2016

106 Page 17 Borrui Data Technology Co. Ltd 2016

107 Page 18 Borrui Data Technology Co. Ltd 2016

108 Page 19 Borrui Data Technology Co. Ltd 2016

109 Page 20 Borrui Data Technology Co. Ltd 2016

110 Page 21 Borrui Data Technology Co. Ltd 2016

111 Page 22 Borrui Data Technology Co. Ltd 2016

Language Reference Manual simplicity

Language Reference Manual simplicity Course: COMS S4115 Professor: Dr. Stephen Edwards TA: Graham Gobieski Date: July 20, 2016 Group members Rui Gu rg2970 Adam Hadar anh2130 Zachary Moffitt znm2104 Suzanna