Gnumeric provides two types of frequencies tables:
The frequency tool can be used to create frequency tables for non-numerical data. It presents this table numerically as well as graphically.
If your data are numeric and you want to accumulate whole intervals of values into frequency counts then this tool is not appropriate. In that case you may want to use the histogram table tool described in section Section 8.2.4.2 ― Histogram Tool.
As shown in Figure 8-15, the frequency table dialog has four tabs. We will introduce them in sequence.
The “Input” tab shown in Figure 8-15 contains the field specifying the data to be used for the histogram.
The “Input Range” entry contains a single range or a list of ranges, that can be grouped into variables by rows, columns, or areas.
If the first row or column of the given input ranges, or the first field of each area contains labels, the “Labels” option should be selected. If the input is grouped by areas and the top left cell contains a label, the other cells in the first row are being ignored.
The “Categories” tab permits the specification of a range that contains the possible values that are supposed to be counted in the input range.
The “automatic categories” option is disabled since it is not yet implemented.
The “Graphs & Options” tab allows various options to be set. In the top half of the tab you can choose whether you would like a graph to be created. If you choose to have a graph created you can specify whether you would like to see a bar chart or a column chart.
In the bottom part of the tab you can select the “percentages” option. This option replaces the frequency counts with percentages.
If the categories range contains repeated values, then the percentages may add up to more than 100%. If the the categories range does not contain all values that occur in the input range, the percentages may sum to less than 100%.
The “Use exact comparisons” checkbox determines how category values and input range values are compared. If it is checked then the function EXACT is used for the comparison. If it isn't checked then simple equality is used. In this latter case, empty cells and cells containing the numerical value 0 are considered equal. As a consequence you usually want that checkbox to be selected.
The histogram tool can be used to create histograms or frequency tables for numerical data. Using this tool you can define intervals, or “bins”. The tool determines how many data points belong to each bin and presents this number numerically as well as graphically.
If your data are non-numeric this tool is not appropriate. In that case you may want to use the frequency table tool described in section Section 8.2.4.1 ― Frequency Tables Tool.
As shown in Figure 8-19, the histogram dialog has five tabs. We will introduce them in sequence.
The “Input” tab shown in Figure 8-19 contains the field specifying the data to be used for the histogram.
The “Input Range” entry contains a single range or a list of ranges, that can be grouped into variables by rows, columns, or areas.
If the first row or column of the given input ranges, or the first field of each area contains labels, the “Labels” option should be selected. If the input is grouped by areas and the top left cell contains a label, the other cells in the first row are being ignored.
The cutoffs for the histogram can either be predetermined by data contained in your workbook or calculated by the histogram tool. These cutoffs determine bins as defined by the selection on the “Bins” tab.
Select the “Predetermined Cutoffs” option to specify data on your worksheet in the “Cutoff Range:” entry. The values in this range will be used as cutoffs c1, c2, and so on to cn.
Select the “Calculated Cutoffs” option to have the cutoffs determined by the tool. Enter the desired number of cutoffs in the “Number of Cutoffs” entry. It is strongly recommended (but optional) that you specify the minimum and maximum cutoffs in the “Minimum cutoff” and “Maximum cutoff” entries. If the minimum or maximum cutoff is not specified, the tool will use the minimum and/or maximum of the current data.
The bins tab is used to determine how the cutoffs c1, c2, and so on to cn are translated into bins. Specifically, it has to be determined whether first and/or last bins reaching from −∞ to c1 and from cn to ∞ are added and whether data points that much cutoffs exactly are included in the bin to the right or the left.
For example the option “[∙,∙),[∙,∙),⋯, [∙,∙),[∙,∞) ” indicates that the first bin starts at the first cutoff while the last bin ends at ∞. Moreover, each cutoff value belongs to the bin on its right.
The options in the graphs and options tab specify any graph to be created and modify the appearance of the histogram:
The “No chart” option causes the chart to be omitted.
The “Bar chart” option causes a bar chart to be added to the histogram. For each bin, the bar chart shows a horizontal bar indicating the frequency.
The “Column chart” option causes a column chart to be added to the histogram. For each bin, the column chart shows a vertical bar indicating the frequency.
The “Histogram chart” option causes a histogram chart to be added to the histogram. For each bin, the histogram chart shows a vertical bar indicating the density (that is the frequency divided by the width of the bin).
The “Percentages” option causes the frequencies to be expressed as percentages.
The “Cumulative answers” option causes a cumulative frequency table (either with counts or with pecentages) to be created.
The “Count numbers only” option determines whether only numbers are counted. If also non-numbers are counted they are first converted into numbers, usually into 0.
The Output tab contains the standard output options and fields described in Section 8.1 ― Overview.
For example, you want to calculate a histogram for the number of successes in several sequences of trials. The numbers of successes are recorded in column A and the cutoffs of interest in column C (see Figure 8-23).
The results are shown in Figure 8-27. Note that the graph will by default appear on top of the histogram table. It usually needs to be moved in to proper position. That has already been done here.