The Dendrogrammer
Dendrogram  Demo data
Look under the "Help" tab above for the Quick-start guide.

  

Load different data

Load a different data set by clicking one of the links in the table below.
eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrLocn-single-cityblock.dat
eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrLocn-single-euclidean.dat
eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrTerm-single-cityblock.dat
eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrTerm-single-euclidean.dat
eciQc3pt1FullE2kExRjctdUnfilteredClstrLocn-single-cityblock.dat
eciQc3pt1FullE2kExRjctdUnfilteredClstrLocn-single-euclidean.dat
eciQc3pt1FullE2kExRjctdUnfilteredClstrTerm-single-cityblock.dat
eciQc3pt1FullE2kExRjctdUnfilteredClstrTerm-single-euclidean.dat
eciQc3pt1batches01-05ClstrTerm-single-euclidean.dat
eciQc3pt1batches01-05Top5PerTerm-LocnVectorsClstrLocn-single-euclidean.dat
eciQc3pt1batches01-05Top5PerTerm-TermVectorsClstrTerm-single-euclidean.dat

The current data

Below are details of the currently loaded data set:
The largest merge height (dissimilarity value) is 0.
There are 0 leaf nodes.

Dat file name=eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrLocn-single-euclidean.dat

It contains the following data file names and description text:
Line NoFile name
1eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrLocn-single-euclidean.node.csv
2eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrLocn-single-euclidean.leaf.csv
3Data from :eciQc3pt1FullE2kExRjctdBalByScrpTermTop11Linkage :singleMetric :euclidean
4http://www.macs.hw.ac.uk/~dar14/fb3/eci_pics_db_manager.php?action=find_by_exptNoCsList&showExptNos=
5

Node file name=eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrLocn-single-euclidean.node.csv

Line Nocol1col2col3
11161170.12247449
218210.12927499
316170.14142136
4202060.15
5390.16583124
6192070.16583124
72110.16583124
8222050.17021723
924310.17095112
102082100.17320508
11262120.17625687
12122090.17901541
13142150.18027756
14132160.19370761
15252140.19402455
16272180.2
171551580.2
18102130.2
1989900.2
202112170.20203045
211361370.21213203
22232190.21213203
23932220.21794495
24292250.21794495
251181190.2236068
26152230.2236068
271132040.2236068
28922260.2236068
291641670.22392359
3072210.22715315
3152330.22912878
32322270.22912878
331931970.22912878
341511520.22912878
3556580.22912878
3634370.22912878
371142300.23452079
3868700.23452079
39332350.23979158
401562200.23979158
411122400.23979158
42752410.23979158
431592430.23979158
442282440.24494897
4542340.24494897
461542460.24494897
471091100.24494897
4812480.24722932
491821860.25
501731770.25
511492370.25
522472500.25
53392390.25
541992360.25495098
551221240.25495098
56912310.25495098
572012020.25495098
581842520.25495098
591602320.25615969
601431440.25980762
611011070.25980762
621942600.25980762
631332240.25980762
642572650.26457513
651952670.26457513
66302420.26457513
671752530.26457513
68672450.26457513
691112550.26457513
701702620.26863655
712032680.26925824
721381390.26925824
7362510.27330167
7471730.27386128
7582760.27386128
762492540.27386128
771632730.2771431
781852610.27838822
791502790.27838822
801302580.27838822
811232830.27838822
82352560.28284271
831262840.28284271
84641350.28722813
852662870.28722813
86631790.29154759
871572820.29154759
88952590.29154759
891342880.29580399
901711810.29580399
911152720.29580399
92962910.29580399
931622800.29644525
941652960.29804715
9597980.3
961662970.3
971051060.30413813
98762770.30413813
991891910.30413813
1001871900.30413813
101692710.30413813
1022952980.30413813
1031532900.30413813
10482840.30413813
105662890.30413813
1061982740.3082207
1071612990.3082207
1083023030.3082207
1091272860.3082207
110652920.3082207
1112813110.3122499
112723040.3122499
11386870.3122499
1143083130.3122499
1151761800.31376201
1163143170.31622777
1173003090.31622777
118523190.31622777
119623210.31622777
1201833220.31622777
12146490.31622777
1221422750.31622777
1231452630.31866014
124282690.31995075
1252703230.32015621
12645470.32015621
1272933280.32015621
1281323300.32015621
129743150.32015621
130803070.32015621
1313163330.32015621
1322853310.32403703
1331203120.32403703
13459600.32403703
13583850.32787193
136943050.32787193
1373353360.32787193
1381253400.32787193
139503240.32787193
1403293420.32787193
1411783180.32985427
1423383410.33166248
1433443450.33166248
1443343460.33166248
1453063470.33166248
1463263480.33166248
14740440.33166248
1481021040.3340807
149573490.3354102
1501403520.3354102
151993390.3354102
1521003200.33846231
153363500.3391165
154413560.3391165
1552003550.34278273
1561743530.34278273
1571683590.34278273
1581313600.34278273
1592383610.34278273
1601293620.34354247
1611723630.34641016
162883640.34641016
1633573650.35355339
1641283660.35355339
165543670.35355339
166613680.35707142
167383690.35707142
1681693100.35707142
169423700.35707142
170813720.36055513
171773010.36400549
1723373730.36742346
1733323740.36742346
174433750.37080992
1751411470.37416574
1763583770.37749172
1773513790.37749172
1781461480.37749172
1793253800.38078866
180513430.38078866
1812643820.38405729
1821213840.38405729
183793850.38729833
1843713860.38729833
1853833870.38729833
1861923880.38729833
1871033890.39370039
1881963900.3968627
189533910.40311289
1901083920.40678749
191553930.41833001
1923763940.42426407
1931883950.42426407
1943783960.42720019
1953813970.43011626
196783980.45276926
1973273990.49497475
1983544000.50990195
1992944010.51961524
200484020.56789083
2012294030.58309519
2022784040.60827625

Leaf file name=eciQc3pt1FullE2kExRjctdBalByScrpTermTop11ClstrLocn-single-euclidean.leaf.csv

Line Nocol1col2
110246
210036
310042
410993
511026
611735
711048
810035
910330
1010386
1111016
1210234
1311694
1411209
1511620
1611759
1711778
1811867
1910075
2011008
2111470
2210056
2311988
2410982
2510461
2610528
2711036
2810690
2910590
3010953
3111311
3211667
3311944
3411725
3510142
3611757
3710572
3810077
3911193
4011292
4111666
4211916
4310074
4410894
4510455
4610025
4710701
4811551
4911082
5011593
5111519
5210712
5310087
5411833
5510487
5611906
5710367
5811520
5910265
6011885
6110974
6211736
6310676
6411351
6511998
6611962
6711114
6810930
6910851
7010649
7111446
7210916
7310308
7411424
7510253
7611606
7710227
7811507
7910159
8010663
8111923
8210430
8311610
8411658
8510648
8611352
8711588
8810542
8910417
9010551
9110168
9211013
9311533
9411157
9510031
9611472
9711734
9811743
9911012
10010342
10110175
10210091
10311367
10410546
10511038
10611254
10711529
10810353
10910570
11010722
11110996
11211541
11310252
11411589
11510574
11610700
11711717
11811957
11911707
12011072
12111063
12210451
12311932
12410201
12511325
12610052
12710255
12811366
12911851
13010336
13111517
13211783
13310762
13410431
13510757
13611535
13711360
13811273
13911510
14011196
14111714
14211828
14310545
14411030
14510055
14610108
14710654
14811374
14911569
15010939
15110984
15210581
15311890
15410322
15510591
15611339
15711583
15811826
15911870
16010817
16110314
16210640
16311151
16410413
16510843
16610344
16710155
16810543
16910875
17011158
17110072
17211131
17311000
17410475
17511584
17610836
17710598
17810610
17911671
18010622
18110877
18211044
18310109
18410011
18511581
18610440
18711624
18810497
18911585
19011724
19110453
19210977
19311405
19411138
19511272
19610786
19710862
19810016
19910139
20011361
20110305
20210315
20310459

Groups data will be displayed here.

Here will appear output in response to formation of groups using the green dragable threshold bar and the "Apply threshold" button, or the "Set no.of groups" toolbar.

Dendrogrammer v 1.0.5

The Dendrogrammer was created as one component in the analysis of multivariate data. In particular it is intended to allow already pre-processed data to be
  • classified into groups and then either
  • used in further analysis, or
  • used in some form of presentation about the data.
Current Features
  • Dendrogram construction
  • File read
  • File read now includes a manifest file which cues a clustering data file (node data) and a leaf description data file (leaf data).
  • File read now is integrated in the application page. Hard coded data loads if no data files are found.
  • Window width optimisation
  • Group formation by adjustable threshold
  • Cluster click/tap reports into dialog
  • Group formation reports with dialog, reports into separate tab, that tab pulses to notify. The pulsing stops when the tab is clicked
  • The threshold is a dragable element on the graph
  • The threshold can be adjusted also using toolbar buttons
  • The desired number of groups can be entered and it will set the threshold to fit
  • Leaf labels on the vertical axis. These are interactively styled in response to events.
  • Dialogs now guaranteed to be under 75% of window ht, scrolling if needed and using a default or wider width as appropriate on PC platform. Scrolling dialogs are not deployed on tablet platform as these are not suitable for tablets.
  • The group banding (and groups) can be removed with the "Remove" button
  • Summary dendrogram: For use in dendrograms with a very large number of leaves. Effectively truncating the dendrogram up from 0 ht to a level at which the structure can be discerned and then the clusters interrogated, rather than having hundreds of leaves along the bottom.
  • The Descendant Nodes Dialog contains a search link. The URL for this is specified in the .dat file which loads the data. The search link passes the cluster's leaf terms to a search engine.
Documentation The technologies deployed in the application include
  • JavaScript - The major part of the application is written in JavaScript
  • Raphael Javscript library (by Dmitry Baranovskiy) - Used to make the dendrogram graphic elements. Raphael is truly amazing.
  • (Indirectly) SVG and VML - Raphael uses SVG or VML to do the drawing. Raphael switches seemlessly depending on the browser
  • jQuery - For some of the cross browser Dynamic HTML
  • jQuery UI - For some of the user interface supporting the application
  • HTML5 - Although there is nothing especially HTML5 about it really.
  • PHP - the data files are loaded using PHP.
The Dendrogrammer was created
  • by David Allan Robb
  • for his MSc project at Heriot-Watt University
  • in summer 2011
Quick-start Guide

In a nutshell-
  • Drag the green bar (Threshold bar) to adjust the dissimilarity threshold for group formation. Then click the "Apply threshold" button to form groups.
  • "Remove" will get rid of the groups.
  • Or enter the desired no. of groups in the "no. of groups field" and click the "Apply no. of groups" button.
  • To create a summary dendrogram, drag the threshold, then click "Truncate".
  • "Restore" redraws the original dendrogram.
  • Another way to adjust the threshold is to enter a number in the "Threshold value field".
Help

Dendrogram tab - View and interact with the dendrogram made from the clustering data.

  • Click or tap a cluster (element on the graph). Then read about that cluster in the dialog that appears.
  • That dialog also contains a search link. The URL for this is specified in the .dat file which loads the data. The search link passes the cluster's leaf terms to a search engine. It should open in a browser pop-up window or browser tab.
  • Adjust the green threshold bar and click the button to form groups of leaves (data points)
  • See more detail about the groups you form under the "Groups" tab.
  • The threshold can be adjusted by click-and-drag on the green bar, or using the threshold toolbar buttons, or entering a number in the "Threshold value field".
  • If you are using a tablet computer then drag on the green bar with your finger. If you find you can't drag the bar accurately then use the toolbar buttons to move the threshold. You might find that after moving the threshold bar away from the max with a button then you can get a better feel for how it drags using your finger.
  • If using the toolbar fine adjust buttons (plus and minus), note that the amount of movement with minus is set slightly less that with plus. This allows even more fine adjustment if you alternate from plus to minus around the point you are looking for.
  • To specify a given number of groups, enter the desired number in the field next to the "Apply no.of groups" button and click the button. The threshold will be chosen automatically for you.
  • To view a summary dendrogram (or truncated dendrogram), drag the threshold to some point below which you wish to discard the leaf detail. Then click the "Truncate" button. The resulting summary dendrogram will have everything below the threshold removed and be reshaped to fit the previous space. The origin is at the dissimilarity ht. you chose with the threshold bar. Tapping/clicking a cluster will still report all the leaf detail as before, despite the leaves being invisible. Groups can be formed in the same manner as before.
  • The "Restore" button will restore the original full dendrogram and the leaves as it was when you first viewed the current data. Loading new data or the same data again will have the same effect, as will reloading the page with your browser.
  • Use Control and + or - to zoom in and out ( or pinch on a tablet ). Look in your browser's View menu for more zoom controls.
  • What equipment to use: The dendrogrammer was designed to be cross platform (works on PCs with mouse and tablets with tap)
  • What browser to use: The dendrogrammer was designed to be cross browser. It is written in JavaScript (with a little bit of PHP to load the data). So it works in pretty much any browser that has Javascript enabled. However it is my experience that if one considers the latest browsers, the JavaScript interpreters in Firefox (Gekko with Tracemonkey), Safari (Webkit with Squirrelfish), and Chrome (Webkit with V8) are significantly faster (e.g. by a factor of 2 and more) than the "JScript" engine in Internet Explorer 8 and below (However, IE9 32-bit is good, but IE9 64-bit is reputed to be slow). So if one is seeking faster smoother performance with this particular application then one might wish to use Firefox, Chrome, or Safari (as at July 2011).

Data tab - Load different data or find out about the current data used to make the dendrogram.

  • A table of links is shown representing all the available data sets. Clicking a link loads that data set and makes a dendrogram from it.
  • The currently loaded data set is described and displayed in 3 tables
  • One table shows the contents of the .dat (or manifest) file. This contains meta data about the data set: the names of the two data files and a description of the data set.
  • The next table shows the contents of the clustering data file (or node file). This should be a csv file generated by MATLAB. It should have 3 columns. The first two columns are cluster or leaf designation numbers. The third column contains the merge height or similarity measure for that cluster.
  • The last table shows the contents of the leaf description data file (or leaf file). This is a csv file. The first row describes leaf number one. The second row describes leaf number two and so on. It should contain two columns. Column one should hold the short leaf descriptions or labels. Column 2 should contain a longer description for each leaf (this can be a repeat of the short label).
  • Loading a new data set or the same data set again will result in a page refresh and will clear any displayed groups or summary dendrogram which you may be displayed on the Dendrogram tab.
  • Adding further data sets can only be done by someone with write-access to the web space where the application resides (See 'Documentation' in the About tab.)

Groups tab- View the detail of the groups formed using the threshold adjuster and toolbar.

  • The detail can be copied and pasted into other software for further analysis or presentation.

About tab - Read about the application.

Help tab - This page you are reading now is the Help.

Zoom Help

If you find the leaf labels too small you may wish to zoom in. If you find your screen height too small to fit a large dendrogram you may wish to zoom out.

How to zoom on a PC or laptop

  • On most browsers Control and + will zoom in.
  • Control and - will zoom out.
  • Control and mouse wheel works too on most browsers.
  • Look in your browser's View menu for more zoom commands. E.g. Some browsers offer a separate zoom for text only. If using that, bear in mind that your browser sees the entire dendrogram including leaf labels as graphics.
  • Once you have zoomed, refreshing the page will adjust the dendrogram to fit the width you have chosen.

On a tablet device

  • Use the open or close pinch gesture to zoom.
  • Once you have zoomed in, as you would expect, refreshing the page does not affect the dendrogram size. It remains zoomed until you zoom back.

Summary Dendrogram feature

  • By setting the threshold bar and clicking "Truncate" you can reduce the amount of leaf detail displayed and reduce the height of the dendrogram.
  • Clicking a cluster on a summary dendrogram still reveals all of its descendant leaf detail in the dialogs. Group formation works as normal.
  • The "Restore" button restores the normal dendrogram.