VegBranch Normalizer

Javascript is disabled! Some VegBank functions will not work without it. More info...

Normalizer

The Normalizer is included in vegbranch in versions beta 4.1 and higher. You can download it as a stand-alone MS-Access database with the above link.

The Normalizer takes a denormalized .csv file and normalizes it. You can specify criteria for this normalization, or let it AutoDetect them. In order to understand this, you must first understand what denormalized and normalized data are.

Normalized data have only one field (column) for each data field. The same data field does not appear in multiple columns. This is "standard" database architecture, but often users have application which require denormalized data, which is often more compact and perhaps more useful in specific situations.

Denormalized data have multiple fields (columns) that describe the same data field, each listed under a different column value for a separate field. For example, if you have many columns with species names and the cover value listed below each, that is a denormalized table (see example 1 below). The normalized table would have only species name in the one column, and cover value in the next column.

The method of labelling the denormalized ID rows to the left, ID rows above, and data cells themselves are my convention. Columns are labelled above, Rows are labelled to the left, and where all three collide, a single cell represents all three labels by using a pipe symbol, | , to separate column label, row label, and data cell label, respectively.

How normalization works:

Combination of first plot and first species : row 1

	Species A	Species B
Plot 1	45%
Plot 2	33%	18%

Plot 1

Species A

45%

Combination of first plot and second species : row 2

	Species A	Species B
Plot 1	45%
Plot 2	33%	18%

Plot 1	Species A	45%
Plot 1	Species B

Combination of second plot and first species : row 3

	Species A	Species B
Plot 1	45%
Plot 2	33%	18%

Plot 1	Species A	45%
Plot 1	Species B
Plot 2	Species A	33%

Combination of second plot and second species : row 4

	Species A	Species B
Plot 1	45%
Plot 2	33%	18%

Plot 1	Species A	45%
Plot 1	Species B
Plot 2	Species A	33%
Plot 2	Species B	18%

Examples

Download the following examples as .csv files, plus a stems example here.

Example 1: Simple example, plots by species with cover values:

(1 ID row, 1 ID column, no labels)

Species A Species B Species C Species D

Plot 1 45% 2%

Plot 2 33% 18% 44%

Plot 3 7% 41%

Plot 4 2% 87% 8%

Key to colors

NORMALIZED output:

unknownLeftLabel1 unknownTopLabel1 unknownCellDataLabel

Plot 1 Species A 45%

Plot 1 Species B

Plot 1 Species C 2%

Plot 1 Species D

Plot 2 Species A 33%

Plot 2 Species B 18%

Plot 2 Species C

Plot 2 Species D 44%

Plot 3 Species A

Plot 3 Species B 7%

Plot 3 Species C

Plot 3 Species D 41%

Plot 4 Species A 2%

Plot 4 Species B

Plot 4 Species C 87%

Plot 4 Species D 8%

Example 2: same as example 1, except includes a complex label for field names:

Plot|Spec|CoverPercent

The complex label consists of three parts: 1) the label for the data that appears in the left column, 2) the label for data in the first row, above the cover data, and 3) the label for the intersection cells, here cover percent.
These are divided with a | in the .csv file. This essentially combines three cells into one.
The three cells in which the complex label could be divided could be colored like this:

Plot

Spec

CoverPercent

(1 ID row, 1 ID column, labels)

Plot|Spec|CoverPercent Species A Species B Species C Species D

Plot 1 45% 2%

Plot 2 33% 18% 44%

Plot 3 7% 41%

Plot 4 2% 87% 8%

Key to colors
NORMALIZED output:

Plot Spec CoverPercent

Plot 1 Species A 45%

Plot 1 Species B

Plot 1 Species C 2%

Plot 1 Species D

Plot 2 Species A 33%

Plot 2 Species B 18%

Plot 2 Species C

Plot 2 Species D 44%

Plot 3 Species A

Plot 3 Species B 7%

Plot 3 Species C

Plot 3 Species D 41%

Plot 4 Species A 2%

Plot 4 Species B

Plot 4 Species C 87%

Plot 4 Species D 8%

Example 3: A bit more complex. Two data fields above and to the left of the cover data, no field labels:

(2 ID rows, 2 ID columns, no labels)

fern tree tree shrub

Species A Species B Species C Species D

steep Plot 1 45% 2%

very steep Plot 2 33% 18% 44%

flat Plot 3 7% 41%

flat Plot 4 2% 87% 8%

Key to colors
NORMALIZED output:

unknownLeftLabel1 unknownLeftLabel2 unknownTopLabel1 unknownTopLabel2 unknownCellDataLabel

steep Plot 1 fern Species A 45%

steep Plot 1 tree Species B

steep Plot 1 tree Species C 2%

steep Plot 1 shrub Species D

very steep Plot 2 fern Species A 33%

very steep Plot 2 tree Species B 18%

very steep Plot 2 tree Species C

very steep Plot 2 shrub Species D 44%

flat Plot 3 fern Species A

flat Plot 3 tree Species B 7%

flat Plot 3 tree Species C

flat Plot 3 shrub Species D 41%

flat Plot 4 fern Species A 2%

flat Plot 4 tree Species B

flat Plot 4 tree Species C 87%

flat Plot 4 shrub Species D 8%

Example 4: Same as example 3, but added field labels:

(2 ID rows, 2 ID columns, labels)

SpeciesType fern tree tree shrub

PlotSlope PlotID|Species|Cover Species A Species B Species C Species D

steep Plot 1 45% 2%

very steep Plot 2 33% 18% 44%

flat Plot 3 7% 41%

flat Plot 4 2% 87% 8%

Key to colors
NORMALIZED output:

PlotSlope PlotID SpeciesType Species Cover

steep Plot 1 fern Species A 45%

steep Plot 1 tree Species B

steep Plot 1 tree Species C 2%

steep Plot 1 shrub Species D

very steep Plot 2 fern Species A 33%

very steep Plot 2 tree Species B 18%

very steep Plot 2 tree Species C

very steep Plot 2 shrub Species D 44%

flat Plot 3 fern Species A

flat Plot 3 tree Species B 7%

flat Plot 3 tree Species C

flat Plot 3 shrub Species D 41%

flat Plot 4 fern Species A 2%

flat Plot 4 tree Species B

flat Plot 4 tree Species C 87%

flat Plot 4 shrub Species D 8%

Example 5: more complicated example (differing number of left and top fields, includes labels):

(4 ID rows, 3 ID columns, labels)

SpecSynonym Species RR

SpeciesCode ACE1 RGE3 EDGR SHe2

SpeciesType fern tree tree shrub

PlotSize PlotSlope PlotID|Species|Cover Species A Species B Species C Species D

44 steep Plot 1 45% 2%

50 very steep Plot 2 33% 18% 44%

80 flat Plot 3 7% 41%

100 flat Plot 4 2% 87% 8%

Key to colors

NORMALIZED output:

PlotSize PlotSlope PlotID SpecSynonym SpeciesCode SpeciesType Species Cover

44 steep Plot 1 ACE1 fern Species A 45%

44 steep Plot 1 RGE3 tree Species B

44 steep Plot 1 EDGR tree Species C 2%

44 steep Plot 1 Species RR SHe2 shrub Species D

50 very steep Plot 2 ACE1 fern Species A 33%

50 very steep Plot 2 RGE3 tree Species B 18%

50 very steep Plot 2 EDGR tree Species C

50 very steep Plot 2 Species RR SHe2 shrub Species D 44%

80 flat Plot 3 ACE1 fern Species A

80 flat Plot 3 RGE3 tree Species B 7%

80 flat Plot 3 EDGR tree Species C

80 flat Plot 3 Species RR SHe2 shrub Species D 41%

100 flat Plot 4 ACE1 fern Species A 2%

100 flat Plot 4 RGE3 tree Species B

100 flat Plot 4 EDGR tree Species C 87%

100 flat Plot 4 Species RR SHe2 shrub Species D 8%

Color Key: (back)

color and name	explanation
empty cell	or unknown column label derived from an empty cell in normalized table.
Left Label	labels the "left data" column that appears below it.
Top Label	labels the "top data" row that appears to its right.
Complex label	consists first of the last "left label", then "\|", then the last "top label", then "\|", then "cell data" label.
Left Data (ID columns)	data that applies to all "cell data" in the same row.
Top Data (ID rows)	data that applies to all "cell data" in the same column.
Cell data	linked to the "top data" in same column and "left data" in same row.
Highlighted cell	illustrates something.

	Species A	Species B	Species C	Species D
Plot 1	45%		2%
Plot 2	33%	18%		44%
Plot 3		7%		41%
Plot 4	2%		87%	8%

unknownLeftLabel1	unknownTopLabel1	unknownCellDataLabel
Plot 1	Species A	45%
Plot 1	Species B
Plot 1	Species C	2%
Plot 1	Species D
Plot 2	Species A	33%
Plot 2	Species B	18%
Plot 2	Species C
Plot 2	Species D	44%
Plot 3	Species A
Plot 3	Species B	7%
Plot 3	Species C
Plot 3	Species D	41%
Plot 4	Species A	2%
Plot 4	Species B
Plot 4	Species C	87%
Plot 4	Species D	8%

		fern	tree	tree	shrub
		Species A	Species B	Species C	Species D
steep	Plot 1	45%		2%
very steep	Plot 2	33%	18%		44%
flat	Plot 3		7%		41%
flat	Plot 4	2%		87%	8%

unknownLeftLabel1	unknownLeftLabel2	unknownTopLabel1	unknownTopLabel2	unknownCellDataLabel
steep	Plot 1	fern	Species A	45%
steep	Plot 1	tree	Species B
steep	Plot 1	tree	Species C	2%
steep	Plot 1	shrub	Species D
very steep	Plot 2	fern	Species A	33%
very steep	Plot 2	tree	Species B	18%
very steep	Plot 2	tree	Species C
very steep	Plot 2	shrub	Species D	44%
flat	Plot 3	fern	Species A
flat	Plot 3	tree	Species B	7%
flat	Plot 3	tree	Species C
flat	Plot 3	shrub	Species D	41%
flat	Plot 4	fern	Species A	2%
flat	Plot 4	tree	Species B
flat	Plot 4	tree	Species C	87%
flat	Plot 4	shrub	Species D	8%

	SpeciesType	fern	tree	tree	shrub
PlotSlope	PlotID\|Species\|Cover	Species A	Species B	Species C	Species D
steep	Plot 1	45%		2%
very steep	Plot 2	33%	18%		44%
flat	Plot 3		7%		41%
flat	Plot 4	2%		87%	8%

PlotSlope	PlotID	SpeciesType	Species	Cover
steep	Plot 1	fern	Species A	45%
steep	Plot 1	tree	Species B
steep	Plot 1	tree	Species C	2%
steep	Plot 1	shrub	Species D
very steep	Plot 2	fern	Species A	33%
very steep	Plot 2	tree	Species B	18%
very steep	Plot 2	tree	Species C
very steep	Plot 2	shrub	Species D	44%
flat	Plot 3	fern	Species A
flat	Plot 3	tree	Species B	7%
flat	Plot 3	tree	Species C
flat	Plot 3	shrub	Species D	41%
flat	Plot 4	fern	Species A	2%
flat	Plot 4	tree	Species B
flat	Plot 4	tree	Species C	87%
flat	Plot 4	shrub	Species D	8%

		SpecSynonym				Species RR
		SpeciesCode	ACE1	RGE3	EDGR	SHe2
		SpeciesType	fern	tree	tree	shrub
PlotSize	PlotSlope	PlotID\|Species\|Cover	Species A	Species B	Species C	Species D
44	steep	Plot 1	45%		2%
50	very steep	Plot 2	33%	18%		44%
80	flat	Plot 3		7%		41%
100	flat	Plot 4	2%		87%	8%