Simple SELECT statement

SELECT statement retrieves rows from the database and has the most complex structure among other SQL statements. Almost any database user is capable of writing a simplest SELECT statement such as

SELECT * FROM PC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

which retrieves all records from the table-type object PC; in so doing rows and columns of the result set have no order. To order columns of the result set they should be listed and separated by commas in the required order after the SELECT keyword:

SELECT price, speed, hd, ram, cd, model, code
FROM PC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

Here is the result set of this query.

price	speed	hd	ram	Cd	model	code
600	500	5	64	12x	1232	1
850	750	14	128	40x	1121	2
600	500	5	64	12x	1233	3
850	600	14	128	40x	1121	4
850	600	8	128	40x	1121	5
950	750	20	128	50x	1233	6
400	500	10	32	12x	1232	7
350	450	8	64	24x	1232	8
350	450	10	32	24x	1232	9
350	500	10	32	12x	1260	10
980	900	40	128	40x	1233	11

The vertical projection of the РC table is obtained by listing the necessary fields only. For example, to get information about the processor speed and the amount of RAM in the computer run the following query:

SELECT speed, ram
FROM PC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

which returns the data:

speed	ram
500	64
750	128
500	64
600	128
600	128
750	128
500	32
450	64
450	32
500	32
900	128

It should be noted that a vertical sample may include duplicate rows in case where the sample does not include any potential key with the values uniquely identify each row in the table. In the PC table, the code field is a potential key, which is specified in addition as primary key. Since this field is not included in the query, there are listed some duplicate rows in the above result set (for example, rows 1 and 3). If unique rows are needed (say, we only need different combinations of processor speed and RAM amount, not specifications of all available PCs), use the DISTINCT keyword:

SELECT DISTINCT speed, ram
FROM PC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

Here’s the result set:

speed	ram
450	32
450	64
500	32
500	64
600	128
750	128
900	128

Apart from DISTINCT, the ALL keyword, which explicitly ask for all rows, may also be applicable. However, ALL keyword is accepted by default.

It is possible to sort out the result set by a number of columns pointed out in the SELECT statement. For this purpose, the clause ORDER BY <list of fields> is used which is always the latest clause in the SELECT statement. In so doing, the sort column in list of fields may be specified as a name or a non negative integer representing the position of the name in SELECT list. For example, to sort the result set by RAM in descending order we can write

SELECT DISTINCT speed, ram
FROM PC
ORDER BY ram DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

SELECT DISTINCT speed, ram
FROM PC
ORDER BY 2 DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

The following result is the same for both above queries.

speed	ram
600	128
750	128
800	128
900	128
450	64
500	64
450	32
500	32

The result set can be sorted in ascending order (ASC is assumed by default) or in descending order (DESC keyword).

Note

It is not recommended to use in applications the queries with sorting by numbers of columns. This is connected with the fact that the structure of a table can change over time, for example, as a result of addition/removal of columns. As consequence, the following query

SELECT *
FROM PC
ORDER BY 3;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

can give absolutely another sequence or generally cause an error, referring to an absent column.

Sorting by two columns

SELECT DISTINCT speed, ram
FROM PC
ORDER BY ram DESC, speed DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

gives the following result:

speed	ram
900	128
800	128
750	128
600	128
500	64
450	64
500	32
450	32

Horizontal restriction is realized by the clause WHERE <predicate> after the FROM clause. Now the result set will only include the rows from the record source for each of those the predicate returns TRUE. In other words, the predicate for each row is checked . For example, the query “get information about processor’s speed and RAM amount for computers priced below $500” can be written as follows:

SELECT DISTINCT speed, ram
FROM PC
WHERE price < 500
ORDER BY 2 DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

speed	Ram
450	64
450	32
500	32

The latter query uses a comparison predicate with operator “<” (less than). Beside this operator, the following operators may be used: “=” (equal), “>” (greater than), “>=” (greater or equal), “<=” (less or equal) and “<>” (not equal). Expressions in comparison predicates may include any columns from the tables listed in the FROM clause. Character strings and date/time constants are enclosed in single quotation marks.

Here are some examples of simple comparison predicates:

Predicate	Description
price < 1000	Price is less than 1000
type = ‘laptop’	Product type is Laptop
cd = ‘24x’	24-speed CD-ROM
color <> ’y’	Not-color printer
ram – 128 > 0	RAM amount is over 128 Mb
Price <= speed*2	Price does not exceed twice processor’s speed

Sorting can be accomplished by the columns absent from SELECT column-list. Naturally, these columns should be presented in the output of FROM clause. For example, to deduce the model list of PCs in the order from greatest price to lowest one, you can write

select model from PC
order by price DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

Notice that the price itself does not be returned by the query. Elimination of duplicates produces ambiguous situation that prevents the behaviour. Thus, the query

select DISTINCT model from PC
order by price DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

gives us the error yet:

ORDER BY items must appear in the select list if SELECT DISTINCT is specified.

The same reason prevents from unerror working of the following query that uses grouping

select model from PC
group by model
order by price DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

Column "PC.price" is invalid in the ORDER BY clause because it is not contained in either an aggregate function or the GROUP BY clause.

But if you eject ambiguity (i.e. to do sorting by an aggregate-function value for a group), the query will work:

select model from PC
group by model
order by MAX(price) DESC;

🚫

[[ error ]]

[[ column ]]
NULL [[ value ]]

Note

All the query examples (including erroneous ones) will work in MySQL, which eliminates ambiguity by itself. Do you want to know how? Look in MySQL documentation. :-)

Suggested exercises: 1, 2, 3, 4, 5, 6, 9, 14, 31, 33, 42.

#SELECT Statement #DISTINCT #ORDER BY #Comparison Predicates #GROUP BY #Sorting

Sorting in order of days of birth

speed	ram
500	64
750	128
500	64
600	128
600	128
750	128
500	32
450	64
450	32
500	32
900	128

speed	ram
500	64
750	128
500	64
600	128
600	128
750	128
500	32
450	64
450	32
500	32
900	128

speed	ram
500	64
750	128
500	64
600	128
600	128
750	128
500	32
450	64
450	32
500	32
900	128