WELCOME TO THE NORMALIZER DBSchema::Normalizer is a Perl module to help you through the process of transforming a MySQL database table from 1st to 2nd (and eventually 3rd) normal form. It is a tool for the database designer, who is supposed to know what it has to be done. The Normalizer does not give you support for deciding which tables need to be normalized and which fields to move across tables. What you get from DBSchema::Normalizer is a quick way of performing the operations you have identified as needed. You can also have a certain degree of flexibility, since the Normalizer can also simulate the operation instead of executing it, giving you plenty of information to decide whether you want to go on or not. DATABASE THEORY You won't find database theory here. There are plenty of other places where to get such information. (http://www.palslib.com/Fundamentals/Database_Design.html is one of them. A sort of crash course on normalization with an elegant example on how to face this problem from a practical side can be found at www.perlmonks.org/index.pl?node_id=129454) I assume that you know what needs to be done, and that this knowledge comes to you by either a regular course or long practice. Either way, lacking the room to spread around some base theory, I feel that, having some hands-on examples on normalization will help you gather some insight on this process, which is too often explained only from a designer point of view, leaving helpless the ones with practical needs. INSTALLATION The usual procedure. perl Makefile.PL make make install make test USAGE use DBSchema::Normalizer; my $norm = DBSchema::Normalizer->new (\%param_hash); $norm->do(); %param_hash must contain at least the information to connect successfully to a mysql database through the DBI. For example, the following two calls are equivalent. The first one assumes that you have a MySQL configuration file (.my.cnf) in your home directory. This syntax is Unix specific. For other operating systems, please refer to the MySQL manual. # using a configuration file my $norm = DBSchema::Normalizer->new ({ DSN => "DBI:mysql:music;host=localhost;" . "mysql_read_default_file=$ENV{HOME}/.my.cnf", src_table => "MP3", index_field => "album_id", lookup_fields => "artist,album,genre", lookup_table => "tmp_albums", dest_table => "songs", copy_indexes => 1, simulate => 1 }); # passing username and password explicitly my $norm = DBSchema::Normalizer->new ({ DSN => "DBI:mysql:music;host=localhost" username => "itsme", password => "secret", src_table => "MP3", index_field => "album_id", lookup_fields => "artist,album,genre", lookup_table => "tmp_albums", dest_table => "songs", copy_indexes => 1, simulate => 1 }); Examples of usage can be found in the DBSchema::Normalizer module itself perldoc DBSchema::Normalizer DISTRIBUTION LIST Changes Version log MANIFEST List of files in this package Makefile.PL (non) installation script Normalizer.pm The Module Itself README this file test.pl test dependencies for DBSchema::Normalizer docs/Normalizer.html documentation in HTML format docs/Normalizer.pod documentation in POD format docs/Normalizer.txt plain text documentation examples/mp3.mysql sample database script MySQL examples/test_normalizer.pl test with sample database DOCUMENTATION All the documentation is embedded into the module itself in POD format, which you can enjoy by typing perldoc DBSchema::Normalizer I decided to spare you some milliseconds of CPU usage (and waste some kilobytes of mass storage in the process) and therefore you also have the same documentation in the following formats: Normalizer.pod POD format, in case you have some CPU cycles to spare and want to produce your own docs. Normalizer.html Browsable version Normalizer.txt If you are using a limited resources connection, or if you just like it, there is a plain text version. And of course the best documentation is the code itself, which you are welcome to peek at. However, try to use only the documented subs and leave the private ones alone (the ones starting by "_"). Please refer to the normal documentation, under "Architecture," to have an idea of the terrible things that expect you if you don't follow my advice. DEPENDENCIES AND REQUIREMENTS Normalizer.pm is built with direct references to MySQL databases, which are accessed through DBI and DBD::mysql. Failing those dependencies, it won't work. You may get the modules from the CPAN, but to try the Normalizer you must have access to a MySQL database, either in your computer or over a network connection. Your account must have SELECT, INSERT, CREATE and DROP privileges in order to use the Normalizer. SETTING UP THE EXAMPLES DBSchema::Normalizer is distributed into a compressed package, containing, among other things, two files that will let you experience a close encounter with the practical side of Data Designing. mp3.mysql test_normalizer.pl mp3.mysql is a sample from a MP3 table. It contains non-normalized data referring to MP3 files. To install it, you need to have a valid account to a MySQL database (with SELECT, CREATE, DROP and INSERT privileges) and run, at your shell prompt $ mysql [-u username] [-p] [-h hostname] < mp3.mysql This command will create a database "music" (if it does not exists,) and a table "MP3" populated with a few dozen records. That done, if you have a MySQL configuration file with username and password in your home directory, you can run the test_normalizer.pl without further steps. If you don't, or if you aren't using "localhost", then you need to edit the script and enter your DSN. The unedited example will not perform any changes in your database. It will run in "Simulation mode." (refer to the documentation for details.) You can use its output for the purposes that suit you better. If you turn simulation mode off ({simulate => 0}) then the Normalizer will act on your behalf. Make sure to read the documentation and to understand what is behind it before venturing into something you may regret. Note: Since I don't want to disrupt existing data, the MP3 table is not created if it already exists. If this is the case, please edit both the SQL and the Perl scripts to adjust them to your needs. AUTHOR Normalizer.pm is copyright (C) Giuseppe Maxia 2001. All rights reserved. Released under the GPL (GNU General Public License) version 2, April 1991. My contact for bugs, comments, advice: gmax@karma.oltrelinux.com