| contents previous next |
Spreadsheets are fine for managing individual worksheets of data, but they are not so suitable when it comes to managing large numbers of worksheets, or complex connections between worksheets.
You can use a catalog as a place to store and structure tabular data from spreadsheets.
For example, you may have a number of spreadsheets like:-
data-jan-5.xls:-
name strain result quality notes r1 a100 56 ok r2 a100 72 med r3 a200 76 bad some noise r4 a200 30 ok r5 a200 22 ok
and strain-info.xls:-
strain sample-room freezer desc a100 100 G knockout of XYZ a200 76 H wild type
and store all your spreadsheets in a directory like:
strains/ strain-info.xls jan/ data-jan-5.xls data-jan-7.xls data-jan-8.xls data-jan-9.xls data-jan-10.xls
The problem then becomes - how to sift out data which is scattered across multiple spreadsheets? (e.g. to choose all the good quality data values for a particular strain). With the spreadsheet you have to resort to time consuming manual copy / paste of rows to extract the data you want.
If you copy the data into Catalyzer, however, you can do instant sorting and searching across all the data.
You could structure the catalog similarly to the file system: and then copy each spreadsheet into a different folder of the catalog.
![]() |
| Catalyzer viewing a single worksheet from a spreadsheet |
Once you have the data as a catalog, you are free to:
![]() |
| Combining data from multiple spreadsheets, and pulling out all the 'bad' quality rows |
Because catalogs are based around familiar notions of folders and subfolders, they are often easier to use than database applications, which typically need a programmer to set up.
You can set up a catalog to store the types of information you may have put in a database before Catalyzer was available.
There are a number of ways of automatically importing the data you have in a database into a catalog (see section importing data from a database).