[project @ 2005-10-17 10:47:10 by simonmar]
[ghc-hetmet.git] / docs / building / building.xml
1 <?xml version="1.0" encoding="iso-8859-1"?>
2 <!DOCTYPE article PUBLIC "-//OASIS//DTD DocBook XML V4.2//EN"
3    "http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd">
4
5 <article id="building-guide">
6
7 <articleinfo>
8
9 <title>Building the Glasgow Functional Programming Tools Suite</title>
10 <author><othername>The GHC Team</othername></author>
11 <address><email>glasgow-haskell-&lcub;users,bugs&rcub;@haskell.org</email></address>
12
13     <abstract>
14       <para>The Glasgow fptools suite is a collection of Functional
15       Programming related tools, including the Glasgow Haskell
16       Compiler (GHC).  The source code for the whole suite is kept in
17       a single CVS repository and shares a common build and
18       installation system.</para>
19
20       <para>This guide is intended for people who want to build or
21       modify programs from the Glasgow <literal>fptools</literal>
22       suite (as distinct from those who merely want to
23       <emphasis>run</emphasis> them). Installation instructions are
24       now provided in the user guide.</para>
25
26       <para>The bulk of this guide applies to building on Unix
27       systems; see <xref linkend="winbuild"/> for Windows notes.</para>
28     </abstract>
29
30 </articleinfo>
31
32
33   <sect1 id="sec-getting">
34     <title>Getting the sources</title>
35     
36     <para>You can get your hands on the <literal>fptools</literal>
37     in two ways:</para>
38
39     <variablelist>
40
41       <varlistentry>
42         <term><indexterm><primary>Source
43         distributions</primary></indexterm>Source distributions</term>
44         <listitem>
45           <para>You have a supported platform, but (a)&nbsp;you like
46           the warm fuzzy feeling of compiling things yourself;
47           (b)&nbsp;you want to build something ``extra&rdquo;&mdash;e.g., a
48           set of libraries with strictness-analysis turned off; or
49           (c)&nbsp;you want to hack on GHC yourself.</para>
50
51           <para>A source distribution contains complete sources for
52           one or more projects in the <literal>fptools</literal>
53           suite.  Not only that, but the more awkward
54           machine-independent steps are done for you.  For example, if
55           you don't have
56           <command>happy</command><indexterm><primary>happy</primary></indexterm>
57           you'll find it convenient that the source distribution
58           contains the result of running <command>happy</command> on
59           the parser specifications.  If you don't want to alter the
60           parser then this saves you having to find and install
61           <command>happy</command>. You will still need a working
62           version of GHC (version 5.x or later) on your machine in
63           order to compile (most of) the sources, however.</para>
64         </listitem>
65       </varlistentry>
66
67       <varlistentry>
68         <term>The CVS repository.<indexterm><primary>CVS repository</primary></indexterm></term>
69         <listitem>
70           <para>We make releases infrequently.  If you want more
71           up-to-the minute (but less tested) source code then you need
72           to get access to our CVS repository.</para>
73
74           <para>All the <literal>fptools</literal> source code is held
75           in a CVS repository. CVS is a pretty good source-code
76           control system, and best of all it works over the
77           network.</para>
78
79           <para>The repository holds source code only. It holds no
80           mechanically generated files at all.  So if you check out a
81           source tree from CVS you will need to install every utility
82           so that you can build all the derived files from
83           scratch.</para>
84
85           <para>More information about our CVS repository can be found
86           in <xref linkend="sec-cvs"/>.</para>
87         </listitem>
88       </varlistentry>
89     </variablelist>
90
91     <para>If you are going to do any building from sources (either
92     from a source distribution or the CVS repository) then you need to
93     read all of this manual in detail.</para>
94   </sect1>
95
96   <sect1 id="sec-cvs">
97     <title>Using the CVS repository</title>
98
99     <para>We use <ulink url="http://www.cvshome.org/">CVS</ulink> (Concurrent Version System) to keep track of our
100     sources for various software projects. CVS lets several people
101     work on the same software at the same time, allowing changes to be
102     checked in incrementally. </para>
103
104     <para>This section is a set of guidelines for how to use our CVS
105     repository, and will probably evolve in time. The main thing to
106     remember is that most mistakes can be undone, but if there's
107     anything you're not sure about feel free to bug the local CVS
108     meister (namely Jeff Lewis
109     <email>jlewis@galois.com</email>). </para>
110
111     <sect2 id="cvs-access">
112       <title>Getting access to the CVS Repository</title>
113
114       <para>You can access the repository in one of two ways:
115       read-only (<xref linkend="cvs-read-only"/>), or read-write (<xref
116       linkend="cvs-read-write"/>).</para>
117
118       <sect3 id="cvs-read-only">
119         <title>Remote Read-only CVS Access</title>
120
121         <para>Read-only access is available to anyone - there's no
122         need to ask us first.  With read-only CVS access you can do
123         anything except commit changes to the repository.  You can
124         make changes to your local tree, and still use CVS's merge
125         facility to keep your tree up to date, and you can generate
126         patches using 'cvs diff' in order to send to us for
127         inclusion. </para>
128
129         <para>To get read-only access to the repository:</para>
130
131         <orderedlist>
132           <listitem>
133             <para>Make sure that <application>cvs</application> is
134             installed on your machine.</para>
135           </listitem>
136           <listitem>
137             <para>Set your <literal>$CVSROOT</literal> environment variable to
138             <literal>:pserver:anoncvs@cvs.haskell.org:/cvs</literal></para>
139             <para>If you set <literal>$CVSROOT</literal> in a shell script, be sure not to
140               have any trailing spaces on that line, otherwise CVS will respond with 
141               a perplexing message like
142               <screen>/cvs : no such repository</screen></para>
143           </listitem>
144           <listitem>
145             <para>Run the command</para>
146 <screen>$ cvs login</screen>
147             <para>The password is simply <literal>cvs</literal>.  This
148             sets up a file in your home directory called
149             <literal>.cvspass</literal>, which squirrels away the
150             dummy password, so you only need to do this step once.</para>
151           </listitem>
152
153           <listitem>
154             <para>Now go to <xref linkend="cvs-first"/>.</para>
155           </listitem>
156         </orderedlist>
157       </sect3>
158
159       <sect3 id="cvs-read-write">
160         <title>Remote Read-Write CVS Access</title>
161
162         <para>We generally supply read-write access to folk doing
163         serious development on some part of the source tree, when
164         going through us would be a pain. If you're developing some
165         feature, or think you have the time and inclination to fix
166         bugs in our sources, feel free to ask for read-write
167         access. There is a certain amount of responsibility that goes
168         with commit privileges; we are more likely to grant you access
169         if you've demonstrated your competence by sending us patches
170         via mail in the past.</para>
171
172         <para>To get remote read-write CVS access, you need to do the
173         following steps.</para>
174
175         <orderedlist>
176           <listitem>
177             <para>Make sure that <literal>cvs</literal> and
178             <literal>ssh</literal> are both installed on your
179             machine.</para>
180           </listitem>
181
182           <listitem>
183             <para>Generate a DSA private-key/public-key pair, thus:</para>
184 <screen>$ ssh-keygen -d</screen>
185             <para>(<literal>ssh-keygen</literal> comes with
186             <literal>ssh</literal>.)  Running <literal>ssh-keygen
187             -d</literal> creates the private and public keys in
188             <literal>$HOME/.ssh/id_dsa</literal> and
189             <literal>$HOME/.ssh/id_dsa.pub</literal> respectively
190             (assuming you accept the standard defaults).</para>
191
192             <para><literal>ssh-keygen -d</literal> will only work if
193             you have Version 2 <literal>ssh</literal> installed; it
194             will fail harmlessly otherwise.  If you only have Version
195             1 you can instead generate an RSA key pair using plain</para>
196 <screen>$ ssh-keygen</screen>
197
198             <para>Doing so creates the private and public RSA keys in
199             <literal>$HOME/.ssh/identity</literal> and
200             <literal>$HOME/.ssh/identity.pub</literal>
201             respectively.</para>
202
203             <para>[Deprecated.]  Incidentally, you can force a Version
204             2 <literal>ssh</literal> to use the Version 1 protocol by
205             creating <literal>$HOME/config</literal> with the
206             following in it:</para>
207 <programlisting>BatchMode Yes
208
209 Host cvs.haskell.org
210 Protocol 1</programlisting>
211
212             <para>In both cases, <literal>ssh-keygen</literal> will
213             ask for a <firstterm>passphrase</firstterm>.  The
214             passphrase is a password that protects your private key.
215             In response to the 'Enter passphrase' question, you can
216             either:</para>
217             <itemizedlist>
218               <listitem>
219                 <para>[Recommended.]  Enter a passphrase, which you
220                 will quote each time you use CVS.
221                 <literal>ssh-agent</literal> makes this entirely
222                 un-tiresome.</para>
223               </listitem>
224               <listitem>
225                 <para>[Deprecated.] Just hit return (i.e. use an empty
226                 passphrase); then you won't need to quote the
227                 passphrase when using CVS.  The downside is that
228                 anyone who can see into your <literal>.ssh</literal>
229                 directory, and thereby get your private key, can mess
230                 up the repository.  So you must keep the
231                 <literal>.ssh</literal> directory with draconian
232                 no-access permissions.</para>
233               </listitem>
234             </itemizedlist>
235
236
237        <para>
238        <emphasis>Windows users: see the notes in <xref linkend="configure-ssh"/> about <command>ssh</command> wrinkles!</emphasis>
239          </para>
240
241
242           </listitem>
243
244           <listitem>
245             <para>Send a message to to the CVS repository
246             administrator (currently Jeff Lewis
247             <email>jeff@galois.com</email>), containing:</para>
248             <itemizedlist>
249               <listitem>
250                 <para>Your desired user-name.</para>
251               </listitem>
252               <listitem>
253                 <para>Your <literal>.ssh/id_dsa.pub</literal> (or
254                 <literal>.ssh/identity.pub</literal>).</para>
255               </listitem>
256             </itemizedlist>
257             <para>He will set up your account.</para>
258           </listitem>
259
260           <listitem>
261             <para>Set the following environment variables:</para>
262            <itemizedlist>
263            <listitem>
264            <para>
265            <constant>$HOME</constant>: points to your home directory.  This is where CVS
266            will look for its <filename>.cvsrc</filename> file.
267            </para>
268            </listitem>
269
270            <listitem>
271            <para>
272            <constant>$CVS_RSH</constant> to <filename>ssh</filename>
273            </para>
274            <para>[Windows users.] Setting your <literal>CVS_RSH</literal> to
275             <literal>ssh</literal> assumes that your CVS client
276             understands how to execute shell script
277             (&quot;#!&quot;s,really), which is what
278             <literal>ssh</literal> is. This may not be the case on
279             Win32 platforms, so in that case set <literal>CVS_RSH</literal> to
280             <literal>ssh1</literal>.</para>
281            </listitem>
282
283              <listitem>
284                 <para><literal>$CVSROOT</literal> to
285                 <literal>:ext:</literal><replaceable>your-username</replaceable>
286                 <literal>@cvs.haskell.org:/home/cvs/root</literal>
287                 where <replaceable>your-username</replaceable> is your user name on
288                 <literal>cvs.haskell.org</literal>.
289                 </para>
290         <para>The <literal>CVSROOT</literal> environment variable will
291         be recorded in the checked-out tree, so you don't need to set
292         this every time. </para>
293
294              </listitem>
295
296         <listitem>
297         <para>
298         <constant>$CVSEDITOR</constant>: <filename>bin/gnuclient.exe</filename> 
299         if you want to use an Emacs buffer for typing in those long commit messages.
300         </para>
301         </listitem>
302
303         <listitem>
304         <para>
305         <constant>$SHELL</constant>: To use bash as the shell in Emacs, you need to
306         set this to point to <filename>bash.exe</filename>.
307         </para>
308         </listitem>
309
310        </itemizedlist>
311
312
313           </listitem>
314
315           <listitem>
316           <para>
317           Put the following in <filename>$HOME/.cvsrc</filename>:
318           </para>
319           
320 <programlisting>checkout -P
321 release -d
322 update -P
323 diff -u</programlisting>
324           
325           <para>
326           These are the default options for the specified CVS commands,
327           and represent better defaults than the usual ones.  (Feel
328           free to change them.)
329           </para>
330           
331           <para>
332           [Windows users.]  Filenames starting with <filename>.</filename> were illegal in 
333           the 8.3 DOS filesystem, but that restriction should have
334           been lifted by now (i.e., you're using VFAT or later filesystems.) If
335           you're still having problems creating it, don't worry; <filename>.cvsrc</filename> is entirely
336           optional.
337           </para>
338           </listitem>
339
340         </orderedlist>
341
342
343         <para>[Experts.]  Once your account is set up, you can get
344         access from other machines without bothering Jeff, thus:</para>
345         <orderedlist>
346           <listitem>
347             <para>Generate a public/private key pair on the new
348             machine.</para>
349           </listitem>
350           <listitem>
351             <para>Use ssh to log in to
352             <literal>cvs.haskell.org</literal>, from your old
353             machine.</para>
354           </listitem>
355           <listitem>
356             <para>Add the public key for the new machine to the file
357             <literal>$HOME/ssh/authorized_keys</literal> on
358             <literal>cvs.haskell.org</literal>.
359             (<literal>authorized_keys2</literal>, I think, for Version
360             2 protocol.)</para>
361           </listitem>
362           <listitem>
363             <para>Make sure that the new version of
364             <literal>authorized_keys</literal> still has 600 file
365             permissions.</para>
366           </listitem>
367         </orderedlist>
368       </sect3>
369     </sect2>
370
371
372
373     <sect2 id="cvs-first">
374       <title>Checking Out a Source Tree</title>
375
376       <itemizedlist>
377         <listitem>
378           <para>Make sure you set your <literal>CVSROOT</literal>
379           environment variable according to either of the remote
380           methods above. The Approved Way to check out a source tree
381           is as follows:</para>
382
383 <screen>$ cvs checkout fpconfig</screen>
384
385           <para>At this point you have a new directory called
386           <literal>fptools</literal> which contains the basic stuff
387           for the fptools suite, including the configuration files and
388           some other junk. </para>
389
390 <para>[Windows users.]  The following messages appear to be harmless:
391 <screen>setsockopt IPTOS_LOWDELAY: Invalid argument
392 setsockopt IPTOS_THROUGHPUT: Invalid argument</screen>
393 </para>
394
395
396           <para>You can call the fptools directory whatever you like,
397           CVS won't mind: </para>
398           
399 <screen>$ mv fptools <replaceable>directory</replaceable></screen>
400
401           <para> NB: after you've read the CVS manual you might be
402           tempted to try</para>
403 <screen>$ cvs checkout -d <replaceable>directory</replaceable> fpconfig</screen>
404
405           <para>instead of checking out <literal>fpconfig</literal>
406           and then renaming it.  But this doesn't work, and will
407           result in checking out the entire repository instead of just
408           the <literal>fpconfig</literal> bit.</para>
409 <screen>$ cd <replaceable>directory</replaceable>
410 $ cvs checkout ghc libraries</screen>
411
412           <para>The second command here checks out the relevant
413           modules you want to work on. For a GHC build, for instance,
414           you need at least the <literal>ghc</literal>,
415           and <literal>libraries</literal>
416           modules (for a full list of the projects available, see
417           <xref linkend="projects"/>).</para>
418
419           <para>Remember that if you do not have
420           <literal>happy</literal> and/or <literal>Alex</literal>
421           installed, you need to check them out as well.</para>
422         </listitem>
423       </itemizedlist>
424     </sect2>
425
426     <sect2 id="cvs-committing">
427       <title>Committing Changes</title>
428
429       <para>This is only if you have read-write access to the
430       repository. For anoncvs users, CVS will issue a &quot;read-only
431       repository&quot; error if you try to commit changes.</para>
432
433       <itemizedlist>
434         <listitem>
435           <para>Build the software, if necessary. Unless you're just
436           working on documentation, you'll probably want to build the
437           software in order to test any changes you make.</para>
438         </listitem>
439
440         <listitem>
441           <para>Make changes. Preferably small ones first.</para>
442         </listitem>
443
444         <listitem>
445           <para>Test them. You can see exactly what changes you've
446           made by using the <literal>cvs diff</literal> command:</para>
447 <screen>$ cvs diff</screen>
448           <para>lists all the changes (using the
449           <literal>diff</literal> command) in and below the current
450           directory. In emacs, <literal>C-c C-v =</literal> runs
451           <literal>cvs diff</literal> on the current buffer and shows
452           you the results.</para>
453         </listitem>
454
455         <listitem>
456           <para>If you changed something in the 
457           <literal>fptools/libraries</literal> subdirectories, also run
458           <literal>make html</literal> to check if the documentation can
459           be generated successfully, too.</para>
460         </listitem>
461
462         <listitem>
463           <para>Before checking in a change, you need to update your
464           source tree:</para>
465
466 <screen>$ cd fptools
467 $ cvs update</screen>
468           <para>This pulls in any changes that other people have made,
469           and merges them with yours. If there are any conflicts, CVS
470           will tell you, and you'll have to resolve them before you
471           can check your changes in. The documentation describes what
472           to do in the event of a conflict.</para>
473
474           <para>It's not always necessary to do a full cvs update
475           before checking in a change, since CVS will always tell you
476           if you try to check in a file that someone else has changed.
477           However, you should still update at regular intervals to
478           avoid making changes that don't work in conjuction with
479           changes that someone else made. Keeping an eye on what goes
480           by on the mailing list can help here.</para>
481         </listitem>
482
483         <listitem>
484           <para>When you're happy that your change isn't going to
485           break anything, check it in. For a one-file change:</para>
486
487 <screen>$ cvs commit <replaceable>filename</replaceable></screen>
488
489           <para>CVS will then pop up an editor for you to enter a
490           &quot;commit message&quot;, this is just a short description
491           of what your change does, and will be kept in the history of
492           the file.</para>
493
494           <para>If you're using emacs, simply load up the file into a
495           buffer and type <literal>C-x C-q</literal>, and emacs will
496           prompt for a commit message and then check in the file for
497           you.</para>
498
499           <para>For a multiple-file change, things are a bit
500           trickier. There are several ways to do this, but this is the
501           way I find easiest. First type the commit message into a
502           temporary file. Then either</para>
503
504 <screen>$ cvs commit -F <replaceable>commit-message</replaceable> <replaceable>file_1</replaceable> .... <replaceable>file_n</replaceable></screen>
505
506           <para>or, if nothing else has changed in this part of the
507           source tree, </para>
508
509 <screen>$ cvs commit -F <replaceable>commit-message</replaceable> <replaceable>directory</replaceable></screen>
510
511           <para>where <replaceable>directory</replaceable> is a common
512           parent directory for all your changes, and
513           <replaceable>commit-message</replaceable> is the name of the
514           file containing the commit message.</para>
515
516           <para>Shortly afterwards, you'll get some mail from the
517           relevant mailing list saying which files changed, and giving
518           the commit message. For a multiple-file change, you should
519           still get only <emphasis>one</emphasis> message.</para>
520         </listitem>
521       </itemizedlist>
522     </sect2>
523
524     <sect2 id="cvs-update">
525       <title>Updating Your Source Tree</title>
526
527       <para>It can be tempting to cvs update just part of a source
528       tree to bring in some changes that someone else has made, or
529       before committing your own changes. This is NOT RECOMMENDED!
530       Quite often changes in one part of the tree are dependent on
531       changes in another part of the tree (the
532       <literal>mk/*.mk</literal> files are a good example where
533       problems crop up quite often). Having an inconsistent tree is a
534       major cause of headaches. </para>
535
536       <para>So, to avoid a lot of hassle, follow this recipe for
537       updating your tree:</para>
538
539 <screen>$ cd fptools
540 $ cvs update -P 2&gt;&amp;1 | tee log</screen>
541
542       <para>Look at the log file, and fix any conflicts (denoted by a
543       <quote>C</quote> in the first column).  New directories may have
544       appeared in the repository; CVS doesn't check these out by
545       default, so to get new directories you have to explicitly do
546 <screen>$ cvs update -d</screen>
547       in each project subdirectory.  Don't do this at the top level,
548       because then <emphasis>all</emphasis> the projects will be
549       checked out.</para>
550
551       <para>If you're using multiple build trees, then for every build
552       tree you have pointing at this source tree, you need to update
553       the links in case any new files have appeared: </para>
554
555 <screen>$ cd <replaceable>build-tree</replaceable>
556 $ lndir <replaceable>source-tree</replaceable></screen>
557
558       <para>Some files might have been removed, so you need to remove
559       the links pointing to these non-existent files:</para>
560
561 <screen>$ find . -xtype l -exec rm '{}' \;</screen>
562
563       <para>To be <emphasis>really</emphasis> safe, you should do
564       </para>
565
566 <screen>$ gmake all</screen>
567
568       <para>from the top-level, to update the dependencies and build
569       any changed files. </para>
570     </sect2>
571
572     <sect2 id="cvs-tags">
573       <title>GHC Tag Policy</title>
574
575       <para>If you want to check out a particular version of GHC,
576       you'll need to know how we tag versions in the repository.  The
577       policy (as of 4.04) is:</para>
578
579       <itemizedlist>
580         <listitem>
581           <para>The tree is branched before every major release.  The
582           branch tag is <literal>ghc-x-xx-branch</literal>, where
583           <literal>x-xx</literal> is the version number of the release
584           with the <literal>'.'</literal> replaced by a
585           <literal>'-'</literal>.  For example, the 4.04 release lives
586           on <literal>ghc-4-04-branch</literal>.</para>
587         </listitem>
588
589         <listitem>
590           <para>The release itself is tagged with
591           <literal>ghc-x-xx</literal> (on the branch).  eg. 4.06 is
592           called <literal>ghc-4-06</literal>.</para>
593         </listitem>
594
595         <listitem>
596           <para>We didn't always follow these guidelines, so to see
597           what tags there are for previous versions, do <literal>cvs
598           log</literal> on a file that's been around for a while (like
599           <literal>fptools/ghc/README</literal>).</para>
600         </listitem>
601       </itemizedlist>
602
603       <para>So, to check out a fresh GHC 4.06 tree you would
604       do:</para>
605
606 <screen>$ cvs co -r ghc-4-06 fpconfig
607 $ cd fptools
608 $ cvs co -r ghc-4-06 ghc libraries</screen>
609     </sect2>
610
611     <sect2 id="cvs-hints">
612       <title>General Hints</title>
613
614       <itemizedlist>
615         <listitem>
616           <para>As a general rule: commit changes in small units,
617           preferably addressing one issue or implementing a single
618           feature.  Provide a descriptive log message so that the
619           repository records exactly which changes were required to
620           implement a given feature/fix a bug. I've found this
621           <emphasis>very</emphasis> useful in the past for finding out
622           when a particular bug was introduced: you can just wind back
623           the CVS tree until the bug disappears.</para>
624         </listitem>
625
626         <listitem>
627           <para>Keep the sources at least *buildable* at any given
628           time. No doubt bugs will creep in, but it's quite easy to
629           ensure that any change made at least leaves the tree in a
630           buildable state. We do nightly builds of GHC to keep an eye
631           on what things work/don't work each day and how we're doing
632           in relation to previous verions. This idea is truely wrecked
633           if the compiler won't build in the first place!</para>
634         </listitem>
635
636         <listitem>
637           <para>To check out extra bits into an already-checked-out
638           tree, use the following procedure.  Suppose you have a
639           checked-out fptools tree containing just ghc, and you want
640           to add nofib to it:</para>
641
642 <screen>$ cd fptools
643 $ cvs checkout nofib</screen>
644
645           <para>or: </para>
646
647 <screen>$ cd fptools
648 $ cvs update -d nofib</screen>
649           
650           <para>(the -d flag tells update to create a new
651           directory). If you just want part of the nofib suite, you
652           can do </para>
653
654 <screen>$ cd fptools
655 $ cvs checkout nofib/spectral</screen>
656
657           <para>This works because <literal>nofib</literal> is a
658           module in its own right, and spectral is a subdirectory of
659           the nofib module. The path argument to checkout must always
660           start with a module name. There's no equivalent form of this
661           command using <literal>update</literal>.</para>
662         </listitem>
663       </itemizedlist>
664     </sect2>
665   </sect1>
666
667   <sect1 id="projects">
668     <title>What projects are there?</title>
669
670     <para>The <literal>fptools</literal> suite consists of several
671     <firstterm>projects</firstterm>, most of which can be downloaded,
672     built and installed individually.  Each project corresponds to a
673     subdirectory in the source tree, and if checking out from CVS then
674     each project can be checked out individually by sitting in the top
675     level of your source tree and typing <command>cvs checkout
676     <replaceable>project</replaceable></command>.</para>
677
678     <para>Here is a list of the projects currently available:</para>
679
680     <variablelist>
681       <varlistentry>
682         <term>
683           <literal>alex</literal>
684           <indexterm><primary><literal>alex</literal></primary><secondary>project</secondary></indexterm>
685         </term>
686         <listitem>
687           <para>The <ulink
688           url="http://www.haskell.org/alex/">Alex</ulink> lexical
689           analyser generator for Haskell.</para>
690         </listitem>
691       </varlistentry>
692
693       <varlistentry>
694         <term>
695           <literal>ghc</literal>
696           <indexterm><primary><literal>ghc</literal></primary>
697           <secondary>project</secondary></indexterm>
698         </term>
699         <listitem>
700           <para>The <ulink url="http://www.haskell.org/ghc/">Glasgow
701           Haskell Compiler</ulink> (minus libraries).  Absolutely
702           required for building GHC.</para>
703         </listitem>
704       </varlistentry>
705
706       <varlistentry>
707         <term>
708           <literal>glafp-utils</literal>
709           <indexterm><primary><literal>glafp-utils</literal></primary><secondary>project</secondary></indexterm>
710         </term>
711         <listitem>
712           <para>Utility programs, some of which are used by the
713           build/installation system.  Required for pretty much
714           everything.</para>
715         </listitem>
716       </varlistentry>
717
718       <varlistentry>
719         <term>
720           <literal>greencard</literal>
721           <indexterm><primary><literal>greencard</literal></primary><secondary>project</secondary></indexterm>
722         </term>
723         <listitem>
724           <para>The <ulink
725           url="http://www.haskell.org/greencard/">GreenCard</ulink>
726           system for generating Haskell foreign function
727           interfaces.</para>
728         </listitem>
729       </varlistentry>
730
731       <varlistentry>
732         <term>
733           <literal>haggis</literal>
734           <indexterm><primary><literal>haggis</literal></primary><secondary>project</secondary></indexterm>
735         </term>
736         <listitem>
737           <para>The <ulink
738           url="http://www.dcs.gla.ac.uk/fp/software/haggis/">Haggis</ulink>
739           Haskell GUI framework.</para>
740         </listitem>
741       </varlistentry>
742
743       <varlistentry>
744         <term>
745           <literal>haddock</literal>
746           <indexterm><primary><literal>haddock</literal></primary><secondary>project</secondary></indexterm>
747         </term>
748         <listitem>
749           <para>The <ulink
750           url="http://www.haskell.org/haddock/">Haddock</ulink>
751           documentation tool.</para>
752         </listitem>
753       </varlistentry>
754
755       <varlistentry>
756         <term>
757           <literal>happy</literal>
758           <indexterm><primary><literal>happy</literal></primary><secondary>project</secondary></indexterm>
759         </term>
760         <listitem>
761           <para>The <ulink
762           url="http://www.haskell.org/happy/">Happy</ulink> Parser
763           generator.</para>
764         </listitem>
765       </varlistentry>
766
767       <varlistentry>
768         <term>
769           <literal>hdirect</literal>
770           <indexterm><primary><literal>hdirect</literal></primary><secondary>project</secondary></indexterm>
771         </term>
772         <listitem>
773           <para>The <ulink
774           url="http://www.haskell.org/hdirect/">H/Direct</ulink>
775           Haskell interoperability tool.</para>
776         </listitem>
777       </varlistentry>
778
779       <varlistentry>
780         <term>
781           <literal>hood</literal>
782           <indexterm><primary><literal>hood</literal></primary><secondary>project</secondary></indexterm>
783         </term>
784         <listitem>
785           <para>The <ulink url="http://www.haskell.org/hood/">Haskell
786           Object Observation Debugger</ulink>.</para>
787         </listitem>
788       </varlistentry>
789
790       <varlistentry>
791         <term>
792           <literal>hslibs</literal>
793           <indexterm><primary><literal>hslibs</literal></primary><secondary>project</secondary></indexterm>
794         </term>
795         <listitem>
796           <para>Old, now deprecated, libraries.  Everything in here is in <literal>libraries</literal>.</para>
797         </listitem>
798       </varlistentry>
799
800       <varlistentry>
801         <term>
802           <literal>libraries</literal>
803           <indexterm><primary><literal></literal></primary><secondary>project</secondary></indexterm>
804         </term>
805         <listitem>
806           <para>Hierarchical Haskell library suite
807           (<emphasis>required</emphasis> for building GHC).</para>
808         </listitem>
809       </varlistentry>
810
811       <varlistentry>
812         <term>
813           <literal>mhms</literal>
814           <indexterm><primary><literal></literal></primary><secondary>project</secondary></indexterm>
815         </term>
816         <listitem>
817           <para>The Modular Haskell Metric System.</para>
818         </listitem>
819       </varlistentry>
820
821       <varlistentry>
822         <term>
823           <literal>nofib</literal>
824           <indexterm><primary><literal>nofib</literal></primary><secondary>project</secondary></indexterm>
825         </term>
826         <listitem>
827           <para>The NoFib suite: A collection of Haskell programs used
828           primarily for benchmarking.</para>
829         </listitem>
830       </varlistentry>
831
832       <varlistentry>
833         <term>
834           <literal>testsuite</literal>
835           <indexterm><primary><literal>testsuite</literal></primary><secondary>project</secondary></indexterm>
836         </term>
837         <listitem>
838           <para>A testing framework, including GHC's regression test
839           suite.</para>
840         </listitem>
841       </varlistentry>
842     </variablelist>
843
844     <para>So, to build GHC you need at least the
845     <literal>ghc</literal> and <literal>libraries</literal>
846     projects (a GHC source distribution will
847     already include the bits you need).</para>
848   </sect1>
849
850   <sect1 id="sec-build-checks">
851     <title>Things to check before you start</title>
852
853     <para>Here's a list of things to check before you get
854     started.</para>
855
856     <orderedlist>
857
858       <listitem><para><indexterm><primary>Disk space needed</primary></indexterm>Disk
859         space needed: from about 100Mb for a basic GHC
860         build, up to probably 500Mb for a GHC build with everything
861         included (libraries built several different ways,
862         etc.).</para>
863       </listitem>
864
865       <listitem>
866         <para>Use an appropriate machine / operating system.  <xref
867         linkend="sec-port-info"/> lists the supported platforms; if
868         yours isn't amongst these then you can try porting GHC (see
869         <xref linkend="sec-porting-ghc"/>).</para>
870       </listitem>
871
872       <listitem>
873         <para>Be sure that the &ldquo;pre-supposed&rdquo; utilities are
874         installed.  <xref linkend="sec-pre-supposed"/>
875         elaborates.</para>
876       </listitem>
877
878       <listitem>
879         <para>If you have any problem when building or installing the
880         Glasgow tools, please check the &ldquo;known pitfalls&rdquo; (<xref
881         linkend="sec-build-pitfalls"/>).  Also check the FAQ for the
882         version you're building, which is part of the User's Guide and
883         available on the <ulink url="http://www.haskell.org/ghc/" >GHC web
884         site</ulink>.</para>
885
886         <indexterm><primary>bugs</primary><secondary>known</secondary></indexterm>
887
888         <para>If you feel there is still some shortcoming in our
889         procedure or instructions, please report it.</para>
890
891         <para>For GHC, please see the <ulink
892         url="http://www.haskell.org/ghc/docs/latest/set/bug-reporting.html">bug-reporting
893         section of the GHC Users' Guide</ulink>, to maximise the
894         usefulness of your report.</para>
895
896         <indexterm><primary>bugs</primary><secondary>seporting</secondary></indexterm>
897         <para>If in doubt, please send a message to
898         <email>glasgow-haskell-bugs@haskell.org</email>.
899         <indexterm><primary>bugs</primary><secondary>mailing
900         list</secondary></indexterm></para>
901       </listitem>
902     </orderedlist>
903   </sect1>
904
905   <sect1 id="sec-port-info">
906     <title>What machines the Glasgow tools run on</title>
907
908 <indexterm><primary>ports</primary><secondary>GHC</secondary></indexterm>
909 <indexterm><primary>GHC</primary><secondary>ports</secondary></indexterm>
910 <indexterm><primary>platforms</primary><secondary>supported</secondary></indexterm>
911
912     <para>The main question is whether or not the Haskell compiler
913     (GHC) runs on your platform.</para>
914
915     <para>A &ldquo;platform&rdquo; is a
916     architecture/manufacturer/operating-system combination, such as
917     <literal>sparc-sun-solaris2</literal>.  Other common ones are
918     <literal>alpha-dec-osf2</literal>,
919     <literal>hppa1.1-hp-hpux9</literal>,
920     <literal>i386-unknown-linux</literal>,
921     <literal>i386-unknown-solaris2</literal>,
922     <literal>i386-unknown-freebsd</literal>,
923     <literal>i386-unknown-cygwin32</literal>,
924     <literal>m68k-sun-sunos4</literal>,
925     <literal>mips-sgi-irix5</literal>,
926     <literal>sparc-sun-sunos4</literal>,
927     <literal>sparc-sun-solaris2</literal>,
928     <literal>powerpc-ibm-aix</literal>.</para>
929
930     <para>Some libraries may only work on a limited number of
931     platforms; for example, a sockets library is of no use unless the
932     operating system supports the underlying BSDisms.</para>
933
934     <sect2>
935       <title>What platforms the Haskell compiler (GHC) runs on</title>
936
937       <indexterm><primary>fully-supported platforms</primary></indexterm>
938       <indexterm><primary>native-code generator</primary></indexterm>
939       <indexterm><primary>registerised ports</primary></indexterm>
940       <indexterm><primary>unregisterised ports</primary></indexterm>
941
942       <para>The GHC hierarchy of Porting Goodness: (a)&nbsp;Best is a
943       native-code generator; (b)&nbsp;next best is a
944       &ldquo;registerised&rdquo; port; (c)&nbsp;the bare minimum is an
945       &ldquo;unregisterised&rdquo; port.
946       (&ldquo;Unregisterised&rdquo; is so terrible that we won't say
947       more about it).</para>
948
949       <para>We use Sparcs running Solaris 2.7 and x86 boxes running
950       FreeBSD and Linux, so those are the best supported platforms,
951       unsurprisingly.</para>
952
953       <para>Here's everything that's known about GHC ports.  We
954       identify platforms by their &ldquo;canonical&rdquo;
955       CPU/Manufacturer/OS triple.</para>
956
957       <variablelist>
958         <varlistentry>
959           <term>alpha-dec-{osf,linux,freebsd,openbsd,netbsd}:
960           <indexterm><primary>alpha-dec-osf</primary></indexterm>
961           <indexterm><primary>alpha-dec-linux</primary></indexterm>
962           <indexterm><primary>alpha-dec-freebsd</primary></indexterm>
963           <indexterm><primary>alpha-dec-openbsd</primary></indexterm>
964           <indexterm><primary>alpha-dec-netbsd</primary></indexterm>
965           </term>
966           <listitem>
967             <para>The OSF port is currently working (as of GHC version
968             5.02.1) and well supported.  The native code generator is
969             currently non-working.  Other operating systems will
970             require some minor porting.</para>
971           </listitem>
972         </varlistentry>
973
974         <varlistentry>
975           <term>sparc-sun-sunos4
976             <indexterm><primary>sparc-sun-sunos4</primary></indexterm>
977           </term>
978           <listitem>
979             <para>Probably works with minor tweaks, hasn't been tested
980             for a while.</para>
981           </listitem>
982         </varlistentry>
983
984         <varlistentry>
985           <term>sparc-sun-solaris2
986             <indexterm><primary>sparc-sun-solaris2</primary></indexterm>
987           </term>
988           <listitem>
989             <para>Fully supported (at least for Solaris 2.7 and 2.6),
990             including native-code generator.</para>
991           </listitem>
992         </varlistentry>
993
994         <varlistentry>
995           <term>sparc-unknown-openbsd
996             <indexterm><primary>sparc-unknown-openbsd</primary></indexterm>
997           </term>
998           <listitem>
999             <para>Supported, including native-code generator. The
1000             same should also be true of NetBSD</para>
1001           </listitem>
1002         </varlistentry>
1003
1004         <varlistentry>
1005           <term>hppa1.1-hp-hpux (HP-PA boxes running HPUX 9.x)
1006             <indexterm><primary>hppa1.1-hp-hpux</primary></indexterm>
1007           </term>
1008           <listitem>
1009             <para>A registerised port is available for version 4.08,
1010             but GHC hasn't been built on that platform since (as far
1011             as we know).  No native-code generator.</para>
1012           </listitem>
1013         </varlistentry>
1014
1015         <varlistentry>
1016           <term>i386-unknown-linux (PCs running Linux, ELF binary format)
1017             <indexterm><primary>i386-*-linux</primary></indexterm>
1018           </term>
1019           <listitem>
1020             <para>GHC works registerised and has a native code
1021             generator.  You <emphasis>must</emphasis> have GCC 2.7.x
1022             or later.  NOTE about <literal>glibc</literal> versions:
1023             GHC binaries built on a system running <literal>glibc
1024             2.0</literal> won't work on a system running
1025             <literal>glibc 2.1</literal>, and vice versa.  In general,
1026             don't expect compatibility between
1027             <literal>glibc</literal> versions, even if the shared
1028             library version hasn't changed.</para>
1029           </listitem>
1030         </varlistentry>
1031
1032         <varlistentry>
1033           <term>i386-unknown-freebsd (PCs running FreeBSD 2.2 or higher)
1034             <indexterm><primary>i386-unknown-freebsd</primary></indexterm>
1035           </term>
1036           <listitem>
1037             <para>GHC works registerised.  Pre-built packages are
1038             available in the native package format, so if you just
1039             need binaries you're better off just installing the
1040             package (it might even be on your installation
1041             CD!).</para>
1042           </listitem>
1043         </varlistentry>
1044
1045         <varlistentry>
1046           <term>i386-unknown-openbsd (PCs running OpenBSD)
1047             <indexterm><primary>i386-unknown-openbsd</primary></indexterm> 
1048           </term>
1049           <listitem>
1050             <para>Supported, with native code generator.  Packages are
1051             available through the ports system in the native package
1052             format.</para>
1053           </listitem>
1054         </varlistentry>
1055
1056         <varlistentry>
1057           <term>i386-unknown-netbsd (PCs running NetBSD)
1058             <indexterm><primary>i386-unknown-netbsd</primary></indexterm>
1059           </term>
1060           <listitem>
1061             <para>Will require some minor porting effort, but should
1062             work registerised.</para>
1063           </listitem>
1064         </varlistentry>
1065
1066         <varlistentry>
1067           <term>i386-unknown-mingw32 (PCs running Windows)
1068             <indexterm><primary>i386-unknown-mingw32</primary></indexterm>
1069           </term>
1070           <listitem>
1071             <para>Fully supported under Win9x, WinNT, Win2k, and
1072             WinXP.  Includes a native code generator.  Building from
1073             source requires a recent <ulink
1074             url="http://www.cygwin.com/">Cygwin</ulink> distribution
1075             to be installed.</para>
1076           </listitem>
1077         </varlistentry>
1078
1079         <varlistentry>
1080           <term>ia64-unknown-linux
1081             <indexterm><primary>ia64-unknown-linux</primary></indexterm>
1082           </term>
1083           <listitem>
1084             <para>Supported, except there is no native code
1085             generator.</para>
1086           </listitem>
1087         </varlistentry>
1088
1089         <varlistentry>
1090           <term>x86_64-unknown-linux
1091             <indexterm><primary>x86_64-unknown-linux</primary></indexterm>
1092           </term>
1093           <listitem>
1094             <para>GHC currently works unregisterised.  A registerised
1095             port is in progress.</para>
1096           </listitem>
1097         </varlistentry>
1098
1099         <varlistentry>
1100           <term>amd64-unknown-openbsd
1101             <indexterm><primary>amd64-unknown-linux</primary></indexterm>
1102           </term>
1103           <listitem>
1104             <para>(This is the same as x86_64-unknown-openbsd). GHC
1105                 currently works unregisterised.  A registerised port is in
1106                 progress.</para>
1107           </listitem>
1108         </varlistentry>
1109
1110         <varlistentry>
1111           <term>mips-sgi-irix5
1112             <indexterm><primary>mips-sgi-irix[5-6]</primary></indexterm>
1113           </term>
1114           <listitem>
1115             <para>Port has worked in the past, but hasn't been tested
1116             for some time (and will certainly have rotted in various
1117             ways).  As usual, we don't have access to machines and
1118             there hasn't been an overwhelming demand for this port,
1119             but feel free to get in touch.</para>
1120           </listitem>
1121         </varlistentry>
1122
1123         <varlistentry>
1124           <term>mips64-sgi-irix6
1125             <indexterm><primary>mips-sgi-irix6</primary></indexterm>
1126           </term>
1127           <listitem>
1128             <para>GHC currently works unregisterised.</para>
1129           </listitem>
1130         </varlistentry>
1131
1132         <varlistentry>
1133           <term>powerpc-ibm-aix
1134             <indexterm><primary>powerpc-ibm-aix</primary></indexterm>
1135           </term>
1136           <listitem>
1137             <para>Port currently doesn't work, needs some minimal
1138             porting effort.  As usual, we don't have access to
1139             machines and there hasn't been an overwhelming demand for
1140             this port, but feel free to get in touch.</para>
1141           </listitem>
1142         </varlistentry>
1143
1144         <varlistentry>
1145           <term>powerpc-apple-darwin
1146             <indexterm><primary>powerpc-apple-darwin</primary></indexterm> 
1147           </term>
1148           <listitem>
1149             <para>Supported registerised.  Native code generator is
1150             almost working.</para>
1151           </listitem>
1152         </varlistentry>
1153
1154         <varlistentry>
1155           <term>powerpc-apple-linux
1156             <indexterm><primary>powerpc-apple-linux</primary></indexterm> 
1157           </term>
1158           <listitem>
1159             <para>Not supported (yet).</para>
1160           </listitem>
1161         </varlistentry>
1162       </variablelist>
1163
1164       <para>Various other systems have had GHC ported to them in the
1165       distant past, including various Motorola 68k boxes.  The 68k
1166       support still remains, but porting to one of these systems will
1167       certainly be a non-trivial task.</para>
1168     </sect2>
1169
1170     <sect2>
1171       <title>What machines the other tools run on</title>
1172
1173       <para>Unless you hear otherwise, the other tools work if GHC
1174       works.</para>
1175     </sect2>
1176   </sect1>
1177
1178
1179   <sect1 id="sec-pre-supposed">
1180     <title>Installing pre-supposed utilities</title>
1181
1182     <indexterm><primary>pre-supposed utilities</primary></indexterm>
1183     <indexterm><primary>utilities, pre-supposed</primary></indexterm>
1184
1185     <para>Here are the gory details about some utility programs you
1186     may need; <command>perl</command>, <command>gcc</command> and
1187     <command>happy</command> are the only important
1188     ones. (PVM<indexterm><primary>PVM</primary></indexterm> is
1189     important if you're going for Parallel Haskell.)  The
1190     <command>configure</command><indexterm><primary>configure</primary></indexterm>
1191     script will tell you if you are missing something.</para>
1192
1193     <variablelist>
1194
1195       <varlistentry>
1196         <term>GHC
1197           <indexterm><primary>pre-supposed: GHC</primary></indexterm>
1198           <indexterm><primary>GHC, pre-supposed</primary></indexterm>
1199         </term>
1200         <listitem>
1201           <para>GHC is required to build many of the tools, including
1202           GHC itself.  If you need to port GHC to your platform
1203           because there isn't a binary distribution of GHC available,
1204           then see <xref linkend="sec-porting-ghc"/>.</para>
1205
1206           <para>Which version of GHC you need will depend on the
1207           packages you intend to build.  GHC itself will normally
1208           build using one of several older versions of itself - check
1209           the announcement or release notes for details.</para>
1210         </listitem>
1211       </varlistentry>
1212
1213       <varlistentry>
1214         <term>Perl
1215           <indexterm><primary>pre-supposed: Perl</primary></indexterm>
1216           <indexterm><primary>Perl, pre-supposed</primary></indexterm>
1217         </term>
1218         <listitem>
1219           <para><emphasis>You have to have Perl to proceed!</emphasis>
1220           Perl version 5 at least is required.  GHC has been known to
1221           tickle bugs in Perl, so if you find that Perl crashes when
1222           running GHC try updating (or downgrading) your Perl
1223           installation.  Versions of Perl before 5.6 have been known to have
1224           various bugs tickled by GHC, so the configure script
1225           will look for version 5.6 or later.</para>
1226
1227           <para>For Win32 platforms, you should use the binary
1228           supplied in the InstallShield (copy it to
1229           <filename>/bin</filename>).  The Cygwin-supplied Perl seems
1230           not to work.</para>
1231
1232           <para>Perl should be put somewhere so that it can be invoked
1233           by the <literal>&num;!</literal> script-invoking
1234           mechanism. The full pathname may need to be less than 32
1235           characters long on some systems.</para>
1236         </listitem>
1237       </varlistentry>
1238
1239       <varlistentry>
1240         <term>GNU C (<command>gcc</command>)
1241           <indexterm><primary>pre-supposed: GCC (GNU C compiler)</primary></indexterm>
1242           <indexterm><primary>GCC (GNU C compiler), pre-supposed</primary></indexterm>
1243         </term>
1244         <listitem>
1245           <para>We recommend using GCC version 2.95.2 on all
1246           platforms.  Failing that, version 2.7.2 is stable on most
1247           platforms.  Earlier versions of GCC can be assumed not to
1248           work, and versions in between 2.7.2 and 2.95.2 (including
1249           <command>egcs</command>) have varying degrees of stability
1250           depending on the platform.</para>
1251
1252           <para>GCC 3.2 is currently known to have problems building
1253           GHC on Sparc, but is stable on x86.</para>
1254           
1255           <para>If your GCC dies with &ldquo;internal error&rdquo; on
1256           some GHC source file, please let us know, so we can report
1257           it and get things improved.  (Exception: on x86
1258           boxes&mdash;you may need to fiddle with GHC's
1259           <option>-monly-N-regs</option> option; see the User's
1260           Guide)</para>
1261         </listitem>
1262       </varlistentry>
1263
1264       <varlistentry>
1265         <term>GNU Make
1266           <indexterm><primary>make</primary><secondary>GNU</secondary></indexterm>
1267         </term>
1268         <listitem>
1269           <para>The fptools build system makes heavy use of features
1270           specific to GNU <command>make</command>, so you must have
1271           this installed in order to build any of the fptools
1272           suite.</para>
1273         </listitem>
1274       </varlistentry>
1275
1276       <varlistentry>
1277         <term>Happy
1278           <indexterm><primary>Happy</primary></indexterm>
1279         </term>
1280         <listitem>
1281           <para>Happy is a parser generator tool for Haskell, and is
1282           used to generate GHC's parsers.  Happy is written in
1283           Haskell, and is a project in the CVS repository
1284           (<literal>fptools/happy</literal>).  It can be built from
1285           source, but bear in mind that you'll need GHC installed in
1286           order to build it.  To avoid the chicken/egg problem,
1287           install a binary distribution of either Happy or GHC to get
1288           started.  Happy distributions are available from <ulink
1289           url="http://www.haskell.org/happy/">Happy's Web
1290           Page</ulink>.</para>
1291         </listitem>
1292       </varlistentry>
1293
1294       <varlistentry>
1295         <term>Alex
1296           <indexterm><primary>Alex</primary></indexterm>
1297         </term>
1298         <listitem>
1299           <para>Alex is a lexical-analyser generator for Haskell,
1300           which GHC uses to generate its lexer.  Like Happy, Alex is
1301           written in Haskell and is a project in the CVS repository.
1302           Alex distributions are available from <ulink
1303           url="http://www.haskell.org/alex/">Alex's Web
1304           Page</ulink>.</para>
1305         </listitem>
1306       </varlistentry>
1307
1308       <varlistentry>
1309         <term>autoconf
1310           <indexterm><primary>pre-supposed: autoconf</primary></indexterm>
1311           <indexterm><primary>autoconf, pre-supposed</primary></indexterm>
1312         </term>
1313         <listitem>
1314           <para>GNU autoconf is needed if you intend to build from the
1315           CVS sources, it is <emphasis>not</emphasis> needed if you
1316           just intend to build a standard source distribution.</para>
1317
1318           <para>Version 2.52 or later of the autoconf package is required.
1319           NB. version 2.13 will no longer work, as of GHC version
1320           6.1.</para>
1321
1322           <para><command>autoreconf</command> (from the autoconf package)
1323           recursively builds <command>configure</command> scripts from
1324           the corresponding <filename>configure.ac</filename> and
1325           <filename>aclocal.m4</filename> files.  If you modify one of
1326           the latter files, you'll need <command>autoreconf</command> to
1327           rebuild the corresponding <filename>configure</filename>.</para>
1328         </listitem>
1329       </varlistentry>
1330
1331       <varlistentry>
1332         <term><command>sed</command>
1333           <indexterm><primary>pre-supposed: sed</primary></indexterm>
1334           <indexterm><primary>sed, pre-supposed</primary></indexterm>
1335         </term>
1336         <listitem>
1337           <para>You need a working <command>sed</command> if you are
1338           going to build from sources.  The build-configuration stuff
1339           needs it.  GNU sed version 2.0.4 is no good!  It has a bug
1340           in it that is tickled by the build-configuration.  2.0.5 is
1341           OK. Others are probably OK too (assuming we don't create too
1342           elaborate configure scripts.)</para>
1343         </listitem>
1344       </varlistentry>
1345     </variablelist>
1346
1347     <para>One <literal>fptools</literal> project is worth a quick note
1348     at this point, because it is useful for all the others:
1349     <literal>glafp-utils</literal> contains several utilities which
1350     aren't particularly Glasgow-ish, but Occasionally Indispensable.
1351     Like <command>lndir</command> for creating symbolic link
1352     trees.</para>
1353
1354     <sect2 id="pre-supposed-gph-tools">
1355       <title>Tools for building parallel GHC (GPH)</title>
1356
1357       <variablelist>
1358         <varlistentry>
1359           <term>PVM version 3:
1360           <indexterm><primary>pre-supposed: PVM3 (Parallel Virtual Machine)</primary></indexterm>
1361             <indexterm><primary>PVM3 (Parallel Virtual Machine), pre-supposed</primary></indexterm>
1362           </term>
1363           <listitem>
1364             <para>PVM is the Parallel Virtual Machine on which
1365             Parallel Haskell programs run.  (You only need this if you
1366             plan to run Parallel Haskell.  Concurrent Haskell, which
1367             runs concurrent threads on a uniprocessor doesn't need
1368             it.)  Underneath PVM, you can have (for example) a network
1369             of workstations (slow) or a multiprocessor box
1370             (faster).</para>
1371
1372             <para>The current version of PVM is 3.3.11; we use 3.3.7.
1373             It is readily available on the net; I think I got it from
1374             <literal>research.att.com</literal>, in
1375             <filename>netlib</filename>.</para>
1376
1377             <para>A PVM installation is slightly quirky, but easy to
1378             do.  Just follow the <filename>Readme</filename>
1379             instructions.</para>
1380           </listitem>
1381         </varlistentry>
1382
1383         <varlistentry>
1384           <term><command>bash</command>:
1385             <indexterm><primary>bash, presupposed (Parallel Haskell only)</primary></indexterm>
1386           </term>
1387           <listitem>
1388             <para>Sadly, the <command>gr2ps</command> script, used to
1389             convert &ldquo;parallelism profiles&rdquo; to PostScript,
1390             is written in Bash (GNU's Bourne Again shell).  This bug
1391             will be fixed (someday).</para>
1392           </listitem>
1393         </varlistentry>
1394       </variablelist>
1395     </sect2>
1396
1397     <sect2 id="pre-supposed-other-tools">
1398       <title>Other useful tools</title>
1399
1400       <variablelist>
1401         <varlistentry>
1402           <term>Flex
1403             <indexterm><primary>pre-supposed: flex</primary></indexterm> 
1404             <indexterm><primary>flex, pre-supposed</primary></indexterm>
1405           </term>
1406           <listitem>
1407             <para>This is a quite-a-bit-better-than-Lex lexer.  Used
1408             to build a couple of utilities in
1409             <literal>glafp-utils</literal>.  Depending on your
1410             operating system, the supplied <command>lex</command> may
1411             or may not work; you should get the GNU version.</para>
1412           </listitem>
1413         </varlistentry>
1414       </variablelist>
1415
1416       <para>More tools are required if you want to format the documentation
1417       that comes with GHC and other fptools projects.  See <xref
1418       linkend="building-docs"/>.</para>
1419     </sect2>
1420   </sect1>
1421
1422   <sect1 id="sec-building-from-source">
1423     <title>Building from source</title>
1424
1425     <indexterm><primary>Building from source</primary></indexterm>
1426     <indexterm><primary>Source, building from</primary></indexterm>
1427
1428     <para>You've been rash enough to want to build some of the Glasgow
1429     Functional Programming tools (GHC, Happy, nofib, etc.) from
1430     source.  You've slurped the source, from the CVS repository or
1431     from a source distribution, and now you're sitting looking at a
1432     huge mound of bits, wondering what to do next.</para>
1433
1434     <para>Gingerly, you type <command>make</command>.  Wrong
1435     already!</para>
1436
1437     <para>This rest of this guide is intended for duffers like me, who
1438     aren't really interested in Makefiles and systems configurations,
1439     but who need a mental model of the interlocking pieces so that
1440     they can make them work, extend them consistently when adding new
1441     software, and lay hands on them gently when they don't
1442     work.</para>
1443
1444     <sect2 id="quick-start">
1445       <title>Quick Start</title>
1446
1447       <para>If you are starting from a source distribution, and just
1448       want a completely standard build, then the following procedure should 
1449       work (unless you're on Windows, in which case go to <xref linkend="winbuild" />).</para>
1450
1451 <screen>$ autoreconf
1452 $ ./configure
1453 $ make
1454 $ make install</screen>
1455
1456       <para>For GHC, this will do a 2-stage bootstrap build of the
1457       compiler, with profiling libraries, and install the
1458       results.</para>
1459
1460       <para>If you want to do anything at all non-standard, or you
1461       want to do some development, read on...</para>
1462     </sect2>
1463
1464     <sect2 id="sec-source-tree">
1465       <title>Your source tree</title>
1466
1467       <para>The source code is held in your <emphasis>source
1468       tree</emphasis>.  The root directory of your source tree
1469       <emphasis>must</emphasis> contain the following directories and
1470       files:</para>
1471
1472       <itemizedlist>
1473         <listitem>
1474           <para><filename>Makefile</filename>: the root
1475           Makefile.</para>
1476         </listitem>
1477
1478         <listitem>
1479           <para><filename>mk/</filename>: the directory that contains
1480           the main Makefile code, shared by all the
1481           <literal>fptools</literal> software.</para>
1482         </listitem>
1483
1484         <listitem>
1485           <para><filename>configure.ac</filename>,
1486           <filename>config.sub</filename>,
1487           <filename>config.guess</filename>: these files support the
1488           configuration process.</para>
1489         </listitem>
1490
1491         <listitem>
1492           <para><filename>install-sh</filename>.</para>
1493         </listitem>
1494       </itemizedlist>
1495
1496       <para>All the other directories are individual
1497       <emphasis>projects</emphasis> of the <literal>fptools</literal>
1498       system&mdash;for example, the Glasgow Haskell Compiler
1499       (<literal>ghc</literal>), the Happy parser generator
1500       (<literal>happy</literal>), the <literal>nofib</literal>
1501       benchmark suite, and so on.  You can have zero or more of these.
1502       Needless to say, some of them are needed to build others.</para>
1503
1504       <para>The important thing to remember is that even if you want
1505       only one project (<literal>happy</literal>, say), you must have
1506       a source tree whose root directory contains
1507       <filename>Makefile</filename>, <filename>mk/</filename>,
1508       <filename>configure.ac</filename>, and the project(s) you want
1509       (<filename>happy/</filename> in this case).  You cannot get by
1510       with just the <filename>happy/</filename> directory.</para>
1511     </sect2>
1512
1513     <sect2>
1514       <title>Build trees</title>
1515       <indexterm><primary>build trees</primary></indexterm>
1516       <indexterm><primary>link trees, for building</primary></indexterm>
1517
1518       <para>If you just want to build the software once on a single
1519       platform, then your source tree can also be your build tree, and
1520       you can skip the rest of this section.</para>
1521
1522       <para>We often want to build multiple versions of our software
1523       for different architectures, or with different options
1524       (e.g. profiling).  It's very desirable to share a single copy of
1525       the source code among all these builds.</para>
1526
1527       <para>So for every source tree we have zero or more
1528       <emphasis>build trees</emphasis>.  Each build tree is initially
1529       an exact copy of the source tree, except that each file is a
1530       symbolic link to the source file, rather than being a copy of
1531       the source file.  There are &ldquo;standard&rdquo; Unix
1532       utilities that make such copies, so standard that they go by
1533       different names:
1534       <command>lndir</command><indexterm><primary>lndir</primary></indexterm>,
1535       <command>mkshadowdir</command><indexterm><primary>mkshadowdir</primary></indexterm>
1536       are two (If you don't have either, the source distribution
1537       includes sources for the X11
1538       <command>lndir</command>&mdash;check out
1539       <filename>fptools/glafp-utils/lndir</filename>). See <xref
1540       linkend="sec-storysofar"/> for a typical invocation.</para>
1541
1542       <para>The build tree does not need to be anywhere near the
1543       source tree in the file system.  Indeed, one advantage of
1544       separating the build tree from the source is that the build tree
1545       can be placed in a non-backed-up partition, saving your systems
1546       support people from backing up untold megabytes of
1547       easily-regenerated, and rapidly-changing, gubbins.  The golden
1548       rule is that (with a single exception&mdash;<xref
1549       linkend="sec-build-config"/>) <emphasis>absolutely everything in
1550       the build tree is either a symbolic link to the source tree, or
1551       else is mechanically generated</emphasis>.  It should be
1552       perfectly OK for your build tree to vanish overnight; an hour or
1553       two compiling and you're on the road again.</para>
1554
1555       <para>You need to be a bit careful, though, that any new files
1556       you create (if you do any development work) are in the source
1557       tree, not a build tree!</para>
1558
1559       <para>Remember, that the source files in the build tree are
1560       <emphasis>symbolic links</emphasis> to the files in the source
1561       tree.  (The build tree soon accumulates lots of built files like
1562       <filename>Foo.o</filename>, as well.)  You can
1563       <emphasis>delete</emphasis> a source file from the build tree
1564       without affecting the source tree (though it's an odd thing to
1565       do).  On the other hand, if you <emphasis>edit</emphasis> a
1566       source file from the build tree, you'll edit the source-tree
1567       file directly.  (You can set up Emacs so that if you edit a
1568       source file from the build tree, Emacs will silently create an
1569       edited copy of the source file in the build tree, leaving the
1570       source file unchanged; but the danger is that you think you've
1571       edited the source file whereas actually all you've done is edit
1572       the build-tree copy.  More commonly you do want to edit the
1573       source file.)</para>
1574
1575       <para>Like the source tree, the top level of your build tree
1576       must be (a linked copy of) the root directory of the
1577       <literal>fptools</literal> suite.  Inside Makefiles, the root of
1578       your build tree is called
1579       <constant>&dollar;(FPTOOLS&lowbar;TOP)</constant><indexterm><primary>FPTOOLS&lowbar;TOP</primary></indexterm>.
1580       In the rest of this document path names are relative to
1581       <constant>&dollar;(FPTOOLS&lowbar;TOP)</constant> unless
1582       otherwise stated.  For example, the file
1583       <filename>ghc/mk/target.mk</filename> is actually
1584       <filename>&dollar;(FPTOOLS&lowbar;TOP)/ghc/mk/target.mk</filename>.</para>
1585     </sect2>
1586
1587     <sect2 id="sec-build-config">
1588       <title>Getting the build you want</title>
1589
1590       <para>When you build <literal>fptools</literal> you will be
1591       compiling code on a particular <emphasis>host
1592       platform</emphasis>, to run on a particular <emphasis>target
1593       platform</emphasis> (usually the same as the host
1594       platform)<indexterm><primary>platform</primary></indexterm>.
1595       The difficulty is that there are minor differences between
1596       different platforms; minor, but enough that the code needs to be
1597       a bit different for each.  There are some big differences too:
1598       for a different architecture we need to build GHC with a
1599       different native-code generator.</para>
1600
1601       <para>There are also knobs you can turn to control how the
1602       <literal>fptools</literal> software is built.  For example, you
1603       might want to build GHC optimised (so that it runs fast) or
1604       unoptimised (so that you can compile it fast after you've
1605       modified it.  Or, you might want to compile it with debugging on
1606       (so that extra consistency-checking code gets included) or off.
1607       And so on.</para>
1608
1609       <para>All of this stuff is called the
1610       <emphasis>configuration</emphasis> of your build.  You set the
1611       configuration using a three-step process.</para>
1612
1613       <variablelist>
1614         <varlistentry>
1615           <term>Step 1: get ready for configuration.</term>
1616           <listitem>
1617             <para>NOTE: if you're starting from a source distribution,
1618             rather than CVS sources, you can skip this step.</para>
1619
1620             <para>Change directory to
1621             <constant>&dollar;(FPTOOLS&lowbar;TOP)</constant> and
1622             issue the command</para>
1623 <screen>$ autoreconf</screen>
1624             <indexterm><primary>autoreconf</primary></indexterm>
1625             <para>(with no arguments). This GNU program (recursively) converts
1626             <filename>&dollar;(FPTOOLS&lowbar;TOP)/configure.ac</filename> and
1627             <filename>&dollar;(FPTOOLS&lowbar;TOP)/aclocal.m4</filename>
1628             to a shell script called
1629             <filename>&dollar;(FPTOOLS&lowbar;TOP)/configure</filename>.
1630               If <command>autoreconf</command> bleats that it can't write the file <filename>configure</filename>,
1631               then delete the latter and try again.  Note that you must use <command>autoreconf</command>,
1632               and not the old <command>autoconf</command>!  If you erroneously use the latter, you'll get 
1633               a message like "No rule to make target 'mk/config.h.in'".
1634             </para>
1635
1636             <para>Some projects, including GHC, have their own configure script.
1637             <command>autoreconf</command> takes care of that, too, so all you have
1638              to do is calling <command>autoreconf</command> in the top-level directory
1639             <filename>&dollar;(FPTOOLS&lowbar;TOP)</filename>.</para>
1640
1641             <para>These steps are completely platform-independent; they just mean
1642             that the human-written files (<filename>configure.ac</filename> and
1643             <filename>aclocal.m4</filename>) can be short, although the resulting
1644             files (the <command>configure</command> shell scripts and the C header
1645             template <filename>mk/config.h.in</filename>) are long.</para>
1646           </listitem>
1647         </varlistentry>
1648
1649         <varlistentry>
1650           <term>Step 2: system configuration.</term>
1651           <listitem>
1652             <para>Runs the newly-created <command>configure</command>
1653             script, thus:</para>
1654
1655 <screen>$ ./configure <optional><parameter>args</parameter></optional></screen>
1656
1657             <para><command>configure</command>'s mission is to scurry
1658             round your computer working out what architecture it has,
1659             what operating system, whether it has the
1660             <function>vfork</function> system call, where
1661             <command>tar</command> is kept, whether
1662             <command>gcc</command> is available, where various obscure
1663             <literal>&num;include</literal> files are, whether it's a
1664             leap year, and what the systems manager had for lunch.  It
1665             communicates these snippets of information in two
1666             ways:</para>
1667
1668             <itemizedlist>
1669               <listitem>
1670                 
1671                 <para>It translates
1672                 <filename>mk/config.mk.in</filename><indexterm><primary>config.mk.in</primary></indexterm>
1673                 to
1674                 <filename>mk/config.mk</filename><indexterm><primary>config.mk</primary></indexterm>,
1675                 substituting for things between
1676                 &ldquo;<literal>@</literal>&rdquo; brackets.  So,
1677                 &ldquo;<literal>@HaveGcc@</literal>&rdquo; will be
1678                 replaced by &ldquo;<literal>YES</literal>&rdquo; or
1679                 &ldquo;<literal>NO</literal>&rdquo; depending on what
1680                 <command>configure</command> finds.
1681                 <filename>mk/config.mk</filename> is included by every
1682                 Makefile (directly or indirectly), so the
1683                 configuration information is thereby communicated to
1684                 all Makefiles.</para>
1685                 </listitem>
1686
1687               <listitem>
1688                 <para> It translates
1689                 <filename>mk/config.h.in</filename><indexterm><primary>config.h.in</primary></indexterm>
1690                 to
1691                 <filename>mk/config.h</filename><indexterm><primary>config.h</primary></indexterm>.
1692                 The latter is <literal>&num;include</literal>d by
1693                 various C programs, which can thereby make use of
1694                 configuration information.</para>
1695               </listitem>
1696             </itemizedlist>
1697
1698             <para><command>configure</command> takes some optional
1699             arguments.  Use <literal>./configure --help</literal> to
1700             get a list of the available arguments.  Here are some of
1701             the ones you might need:</para>
1702
1703             <variablelist>
1704               <varlistentry>
1705                 <term><literal>--with-ghc=<parameter>path</parameter></literal>
1706                   <indexterm><primary><literal>--with-ghc</literal></primary></indexterm>
1707                 </term>
1708                 <listitem>
1709                   <para>Specifies the path to an installed GHC which
1710                   you would like to use.  This compiler will be used
1711                   for compiling GHC-specific code (eg. GHC itself).
1712                   This option <emphasis>cannot</emphasis> be specified
1713                   using <filename>build.mk</filename> (see later),
1714                   because <command>configure</command> needs to
1715                   auto-detect the version of GHC you're using.  The
1716                   default is to look for a compiler named
1717                   <literal>ghc</literal> in your path.</para>
1718                 </listitem>
1719               </varlistentry>
1720               
1721               <varlistentry>
1722                 <term><literal>--with-hc=<parameter>path</parameter></literal>
1723                   <indexterm><primary><literal>--with-hc</literal></primary></indexterm>
1724                 </term>
1725                 <listitem>
1726                   <para>Specifies the path to any installed Haskell
1727                   compiler.  This compiler will be used for compiling
1728                   generic Haskell code.  The default is to use
1729                   <literal>ghc</literal>.</para>
1730                 </listitem>
1731               </varlistentry>
1732               
1733               <varlistentry>
1734                 <term><literal>--with-gcc=<parameter>path</parameter></literal>
1735                   <indexterm><primary><literal>--with-gcc</literal></primary></indexterm>
1736                 </term>
1737                 <listitem>
1738                   <para>Specifies the path to the installed GCC. This
1739                   compiler will be used to compile all C files,
1740                   <emphasis>except</emphasis> any generated by the
1741                   installed Haskell compiler, which will have its own
1742                   idea of which C compiler (if any) to use.  The
1743                   default is to use <literal>gcc</literal>.</para>
1744                 </listitem>
1745               </varlistentry>
1746             </variablelist>
1747           </listitem>
1748         </varlistentry>
1749         
1750         <varlistentry>
1751           <term>Step 3: build configuration.</term>
1752           <listitem>
1753             <para>Next, you say how this build of
1754             <literal>fptools</literal> is to differ from the standard
1755             defaults by creating a new file
1756             <filename>mk/build.mk</filename><indexterm><primary>build.mk</primary></indexterm>
1757             <emphasis>in the build tree</emphasis>.  This file is the
1758             one and only file you edit in the build tree, precisely
1759             because it says how this build differs from the source.
1760             (Just in case your build tree does die, you might want to
1761             keep a private directory of <filename>build.mk</filename>
1762             files, and use a symbolic link in each build tree to point
1763             to the appropriate one.)  So
1764             <filename>mk/build.mk</filename> never exists in the
1765             source tree&mdash;you create one in each build tree from
1766             the template.  We'll discuss what to put in it
1767             shortly.</para>
1768           </listitem>
1769         </varlistentry>
1770       </variablelist>
1771
1772       <para>And that's it for configuration. Simple, eh?</para>
1773
1774       <para>What do you put in your build-specific configuration file
1775       <filename>mk/build.mk</filename>?  <emphasis>For almost all
1776       purposes all you will do is put make variable definitions that
1777       override those in</emphasis>
1778       <filename>mk/config.mk.in</filename>.  The whole point of
1779       <filename>mk/config.mk.in</filename>&mdash;and its derived
1780       counterpart <filename>mk/config.mk</filename>&mdash;is to define
1781       the build configuration. It is heavily commented, as you will
1782       see if you look at it.  So generally, what you do is look at
1783       <filename>mk/config.mk.in</filename>, and add definitions in
1784       <filename>mk/build.mk</filename> that override any of the
1785       <filename>config.mk</filename> definitions that you want to
1786       change.  (The override occurs because the main boilerplate file,
1787       <filename>mk/boilerplate.mk</filename><indexterm><primary>boilerplate.mk</primary></indexterm>,
1788       includes <filename>build.mk</filename> after
1789       <filename>config.mk</filename>.)</para>
1790
1791      <para>For your convenience, there's a file called <filename>build.mk.sample</filename>
1792      that can serve as a starting point for your <filename>build.mk</filename>.</para>
1793
1794       <para>For example, <filename>config.mk.in</filename> contains
1795       the definition:</para>
1796
1797 <programlisting>GhcHcOpts=-O -Rghc-timing</programlisting>
1798
1799       <para>The accompanying comment explains that this is the list of
1800       flags passed to GHC when building GHC itself.  For doing
1801       development, it is wise to add <literal>-DDEBUG</literal>, to
1802       enable debugging code.  So you would add the following to
1803       <filename>build.mk</filename>:</para>
1804       
1805       <para>or, if you prefer,</para>
1806
1807 <programlisting>GhcHcOpts += -DDEBUG</programlisting>
1808
1809       <para>GNU <command>make</command> allows existing definitions to
1810       have new text appended using the &ldquo;<literal>+=</literal>&rdquo;
1811       operator, which is quite a convenient feature.)</para>
1812
1813       <para>If you want to remove the <literal>-O</literal> as well (a
1814       good idea when developing, because the turn-around cycle gets a
1815       lot quicker), you can just override
1816       <literal>GhcLibHcOpts</literal> altogether:</para>
1817
1818 <programlisting>GhcHcOpts=-DDEBUG -Rghc-timing</programlisting>
1819
1820       <para>When reading <filename>config.mk.in</filename>, remember
1821       that anything between &ldquo;@...@&rdquo; signs is going to be substituted
1822       by <command>configure</command> later.  You
1823       <emphasis>can</emphasis> override the resulting definition if
1824       you want, but you need to be a bit surer what you are doing.
1825       For example, there's a line that says:</para>
1826
1827 <programlisting>TAR = @TarCmd@</programlisting>
1828
1829       <para>This defines the Make variables <constant>TAR</constant>
1830       to the pathname for a <command>tar</command> that
1831       <command>configure</command> finds somewhere.  If you have your
1832       own pet <command>tar</command> you want to use instead, that's
1833       fine. Just add this line to <filename>mk/build.mk</filename>:</para>
1834
1835 <programlisting>TAR = mytar</programlisting>
1836
1837       <para>You do not <emphasis>have</emphasis> to have a
1838       <filename>mk/build.mk</filename> file at all; if you don't,
1839       you'll get all the default settings from
1840       <filename>mk/config.mk.in</filename>.</para>
1841
1842       <para>You can also use <filename>build.mk</filename> to override
1843       anything that <command>configure</command> got wrong.  One place
1844       where this happens often is with the definition of
1845       <constant>FPTOOLS&lowbar;TOP&lowbar;ABS</constant>: this
1846       variable is supposed to be the canonical path to the top of your
1847       source tree, but if your system uses an automounter then the
1848       correct directory is hard to find automatically.  If you find
1849       that <command>configure</command> has got it wrong, just put the
1850       correct definition in <filename>build.mk</filename>.</para>
1851
1852     </sect2>
1853
1854     <sect2 id="sec-storysofar">
1855       <title>The story so far</title>
1856
1857       <para>Let's summarise the steps you need to carry to get
1858       yourself a fully-configured build tree from scratch.</para>
1859
1860       <orderedlist>
1861         <listitem>
1862           <para> Get your source tree from somewhere (CVS repository
1863           or source distribution).  Say you call the root directory
1864           <filename>myfptools</filename> (it does not have to be
1865           called <filename>fptools</filename>).  Make sure that you
1866           have the essential files (see <xref
1867           linkend="sec-source-tree"/>).</para>
1868         </listitem>
1869
1870         <listitem>
1871
1872           <para>(Optional) Use <command>lndir</command> or
1873           <command>mkshadowdir</command> to create a build tree.</para>
1874
1875 <screen>$ cd myfptools
1876 $ mkshadowdir . /scratch/joe-bloggs/myfptools-sun4</screen>
1877
1878           <para>(N.B. <command>mkshadowdir</command>'s first argument
1879           is taken relative to its second.) You probably want to give
1880           the build tree a name that suggests its main defining
1881           characteristic (in your mind at least), in case you later
1882           add others.</para>
1883         </listitem>
1884
1885         <listitem>
1886           <para>Change directory to the build tree.  Everything is
1887           going to happen there now.</para>
1888
1889 <screen>$ cd /scratch/joe-bloggs/myfptools-sun4</screen>
1890
1891         </listitem>
1892
1893         <listitem>
1894           <para>Prepare for system configuration:</para>
1895
1896 <screen>$ autoreconf</screen>
1897
1898           <para>(You can skip this step if you are starting from a
1899           source distribution, and you already have
1900           <filename>configure</filename> and
1901           <filename>mk/config.h.in</filename>.)</para>
1902         </listitem>
1903
1904         <listitem>
1905           <para>Do system configuration:</para>
1906
1907 <screen>$ ./configure</screen>
1908
1909           <para>Don't forget to check whether you need to add any
1910           arguments to <literal>configure</literal>; for example, a
1911           common requirement is to specify which GHC to use with
1912           <option>--with-ghc=<replaceable>ghc</replaceable></option>.</para>
1913         </listitem>
1914
1915         <listitem>
1916           <para>Create the file <filename>mk/build.mk</filename>,
1917           adding definitions for your desired configuration
1918           options.</para>
1919
1920 <screen>$ emacs mk/build.mk</screen>
1921         </listitem>
1922       </orderedlist>
1923
1924       <para>You can make subsequent changes to
1925       <filename>mk/build.mk</filename> as often as you like.  You do
1926       not have to run any further configuration programs to make these
1927       changes take effect. In theory you should, however, say
1928       <command>gmake clean</command>, <command>gmake all</command>,
1929       because configuration option changes could affect
1930       anything&mdash;but in practice you are likely to know what's
1931       affected.</para>
1932     </sect2>
1933
1934     <sect2>
1935       <title>Making things</title>
1936
1937       <para>At this point you have made yourself a fully-configured
1938       build tree, so you are ready to start building real
1939       things.</para>
1940
1941       <para>The first thing you need to know is that <emphasis>you
1942       must use GNU <command>make</command>, usually called
1943       <command>gmake</command>, not standard Unix
1944       <command>make</command></emphasis>.  If you use standard Unix
1945       <command>make</command> you will get all sorts of error messages
1946       (but no damage) because the <literal>fptools</literal>
1947       <command>Makefiles</command> use GNU <command>make</command>'s
1948       facilities extensively.</para>
1949
1950       <para>To just build the whole thing, <command>cd</command> to
1951       the top of your <literal>fptools</literal> tree and type
1952       <command>gmake</command>.  This will prepare the tree and build
1953       the various projects in the correct order.</para>
1954     </sect2>
1955
1956     <sect2 id="sec-bootstrapping">
1957       <title>Bootstrapping GHC</title>
1958
1959       <para>GHC requires a 2-stage bootstrap in order to provide 
1960       full functionality, including GHCi.  By a 2-stage bootstrap, we
1961       mean that the compiler is built once using the installed GHC,
1962       and then again using the compiler built in the first stage.  You
1963       can also build a stage 3 compiler, but this normally isn't
1964       necessary except to verify that the stage 2 compiler is working
1965       properly.</para>
1966
1967       <para>Note that when doing a bootstrap, the stage 1 compiler
1968       must be built, followed by the runtime system and libraries, and
1969       then the stage 2 compiler.  The correct ordering is implemented
1970       by the top-level fptools <filename>Makefile</filename>, so if
1971       you want everything to work automatically it's best to start
1972       <command>make</command> from the top of the tree.  When building
1973       GHC, the top-level fptools <filename>Makefile</filename> is set
1974       up to do a 2-stage bootstrap by default (when you say
1975       <command>make</command>).  Some other targets it supports
1976       are:</para>
1977
1978       <variablelist>
1979         <varlistentry>
1980           <term>stage1</term>
1981           <listitem>
1982             <para>Build everything as normal, including the stage 1
1983             compiler.</para>
1984           </listitem>
1985         </varlistentry>
1986
1987         <varlistentry>
1988           <term>stage2</term>
1989           <listitem>
1990             <para>Build the stage 2 compiler only.</para>
1991           </listitem>
1992         </varlistentry>
1993
1994         <varlistentry>
1995           <term>stage3</term>
1996           <listitem>
1997             <para>Build the stage 3 compiler only.</para>
1998           </listitem>
1999         </varlistentry>
2000
2001         <varlistentry>
2002           <term>bootstrap</term> <term>bootstrap2</term>
2003           <listitem>
2004             <para>Build stage 1 followed by stage 2.</para>
2005           </listitem>
2006         </varlistentry>
2007
2008         <varlistentry>
2009           <term>bootstrap3</term>
2010           <listitem>
2011             <para>Build stages 1, 2 and 3.</para>
2012           </listitem>
2013         </varlistentry>
2014
2015         <varlistentry>
2016           <term>install</term>
2017           <listitem>
2018             <para>Install everything, including the compiler built in
2019             stage 2.  To override the stage, say <literal>make install
2020             stage=<replaceable>n</replaceable></literal> where
2021             <replaceable>n</replaceable> is the stage to install.</para>
2022           </listitem>
2023         </varlistentry>
2024       </variablelist>
2025
2026       <para>The top-level <filename>Makefile</filename> also arranges
2027       to do the appropriate <literal>make boot</literal> steps (see
2028       below) before actually building anything.</para>
2029
2030       <para>The <literal>stage1</literal>, <literal>stage2</literal>
2031       and <literal>stage3</literal> targets also work in the
2032       <literal>ghc/compiler</literal> directory, but don't forget that
2033       each stage requires its own <literal>make boot</literal> step:
2034       for example, you must do</para>
2035
2036       <screen>$ make boot stage=2</screen>
2037
2038       <para>before <literal>make stage2</literal> in
2039       <literal>ghc/compiler</literal>.</para>
2040     </sect2>
2041
2042     <sect2 id="sec-standard-targets">
2043       <title>Standard Targets</title>
2044       <indexterm><primary>targets, standard makefile</primary></indexterm>
2045       <indexterm><primary>makefile targets</primary></indexterm>
2046
2047       <para>In any directory you should be able to make the following:</para>
2048
2049       <variablelist>
2050         <varlistentry>
2051           <term><literal>boot</literal></term>
2052           <listitem>
2053             <para>does the one-off preparation required to get ready
2054             for the real work.  Notably, it does <command>gmake
2055             depend</command> in all directories that contain programs.
2056             It also builds the necessary tools for compilation to
2057             proceed.</para>
2058
2059             <para>Invoking the <literal>boot</literal> target
2060             explicitly is not normally necessary.  From the top-level
2061             <literal>fptools</literal> directory, invoking
2062             <literal>gmake</literal> causes <literal>gmake boot
2063             all</literal> to be invoked in each of the project
2064             subdirectories, in the order specified by
2065             <literal>&dollar;(AllTargets)</literal> in
2066             <literal>config.mk</literal>.</para>
2067
2068             <para>If you're working in a subdirectory somewhere and
2069             need to update the dependencies, <literal>gmake
2070             boot</literal> is a good way to do it.</para>
2071           </listitem>
2072         </varlistentry>
2073
2074         <varlistentry>
2075           <term><literal>all</literal></term>
2076           <listitem>
2077             <para>makes all the final target(s) for this Makefile.
2078             Depending on which directory you are in a &ldquo;final
2079             target&rdquo; may be an executable program, a library
2080             archive, a shell script, or a Postscript file.  Typing
2081             <command>gmake</command> alone is generally the same as
2082             typing <command>gmake all</command>.</para>
2083           </listitem>
2084         </varlistentry>
2085
2086         <varlistentry>
2087           <term><literal>install</literal></term>
2088           <listitem>
2089             <para>installs the things built by <literal>all</literal>
2090             (except for the documentation).  Where does it install
2091             them?  That is specified by
2092             <filename>mk/config.mk.in</filename>; you can override it
2093             in <filename>mk/build.mk</filename>, or by running
2094             <command>configure</command> with command-line arguments
2095             like <literal>--bindir=/home/simonpj/bin</literal>; see
2096             <literal>./configure --help</literal> for the full
2097             details.</para>
2098           </listitem>
2099         </varlistentry>
2100
2101         <varlistentry>
2102           <term><literal>install-docs</literal></term>
2103           <listitem>
2104             <para>installs the documentation. Otherwise behaves just
2105             like <literal>install</literal>.</para>
2106           </listitem>
2107         </varlistentry>
2108
2109         <varlistentry>
2110           <term><literal>uninstall</literal></term>
2111           <listitem>
2112             <para>reverses the effect of
2113             <literal>install</literal>.</para>
2114           </listitem>
2115         </varlistentry>
2116
2117         <varlistentry>
2118           <term><literal>clean</literal></term>
2119           <listitem>
2120             <para>Delete all files from the current directory that are
2121             normally created by building the program.  Don't delete
2122             the files that record the configuration, or files
2123             generated by <command>gmake boot</command>.  Also preserve
2124             files that could be made by building, but normally aren't
2125             because the distribution comes with them.</para>
2126           </listitem>
2127         </varlistentry>
2128
2129         <varlistentry>
2130           <term><literal>distclean</literal></term>
2131           <listitem>
2132             <para>Delete all files from the current directory that are
2133             created by configuring or building the program. If you
2134             have unpacked the source and built the program without
2135             creating any other files, <literal>make
2136             distclean</literal> should leave only the files that were
2137             in the distribution.</para>
2138           </listitem>
2139         </varlistentry>
2140
2141         <varlistentry>
2142           <term><literal>mostlyclean</literal></term>
2143           <listitem>
2144             <para>Like <literal>clean</literal>, but may refrain from
2145             deleting a few files that people normally don't want to
2146             recompile.</para>
2147           </listitem>
2148         </varlistentry>
2149
2150         <varlistentry>
2151           <term><literal>maintainer-clean</literal></term>
2152           <listitem>
2153             <para>Delete everything from the current directory that
2154             can be reconstructed with this Makefile.  This typically
2155             includes everything deleted by
2156             <literal>distclean</literal>, plus more: C source files
2157             produced by Bison, tags tables, Info files, and so
2158             on.</para>
2159
2160             <para>One exception, however: <literal>make
2161             maintainer-clean</literal> should not delete
2162             <filename>configure</filename> even if
2163             <filename>configure</filename> can be remade using a rule
2164             in the <filename>Makefile</filename>. More generally,
2165             <literal>make maintainer-clean</literal> should not delete
2166             anything that needs to exist in order to run
2167             <filename>configure</filename> and then begin to build the
2168             program.</para>
2169           </listitem>
2170         </varlistentry>
2171
2172         <varlistentry>
2173           <term><literal>check</literal></term>
2174           <listitem>
2175             <para>run the test suite.</para>
2176           </listitem>
2177         </varlistentry>
2178       </variablelist>
2179
2180       <para>All of these standard targets automatically recurse into
2181       sub-directories.  Certain other standard targets do not:</para>
2182
2183       <variablelist>
2184         <varlistentry>
2185           <term><literal>configure</literal></term>
2186           <listitem>
2187             <para>is only available in the root directory
2188             <constant>&dollar;(FPTOOLS&lowbar;TOP)</constant>; it has
2189             been discussed in <xref
2190             linkend="sec-build-config"/>.</para>
2191           </listitem>
2192         </varlistentry>
2193
2194         <varlistentry>
2195           <term><literal>depend</literal></term>
2196           <listitem>
2197             <para>make a <filename>.depend</filename> file in each
2198             directory that needs it. This <filename>.depend</filename>
2199             file contains mechanically-generated dependency
2200             information; for example, suppose a directory contains a
2201             Haskell source module <filename>Foo.lhs</filename> which
2202             imports another module <literal>Baz</literal>.  Then the
2203             generated <filename>.depend</filename> file will contain
2204             the dependency:</para>
2205
2206 <programlisting>Foo.o : Baz.hi</programlisting>
2207
2208             <para>which says that the object file
2209             <filename>Foo.o</filename> depends on the interface file
2210             <filename>Baz.hi</filename> generated by compiling module
2211             <literal>Baz</literal>.  The <filename>.depend</filename>
2212             file is automatically included by every Makefile.</para>
2213           </listitem>
2214         </varlistentry>
2215
2216         <varlistentry>
2217           <term><literal>binary-dist</literal></term>
2218           <listitem>
2219             <para>make a binary distribution.  This is the target we
2220             use to build the binary distributions of GHC and
2221             Happy.</para>
2222           </listitem>
2223         </varlistentry>
2224
2225         <varlistentry>
2226           <term><literal>dist</literal></term>
2227           <listitem>
2228             <para>make a source distribution.  Note that this target
2229             does &ldquo;make distclean&rdquo; as part of its work;
2230             don't use it if you want to keep what you've built.</para>
2231           </listitem>
2232         </varlistentry>
2233       </variablelist>
2234
2235       <para>Most <filename>Makefile</filename>s have targets other
2236       than these.  You can discover them by looking in the
2237       <filename>Makefile</filename> itself.</para>
2238     </sect2>
2239
2240     <sect2>
2241       <title>Using a project from the build tree</title> 
2242
2243       <para>If you want to build GHC (say) and just use it direct from
2244       the build tree without doing <literal>make install</literal>
2245       first, you can run the in-place driver script:
2246       <filename>ghc/compiler/ghc-inplace</filename>.</para>
2247
2248       <para> Do <emphasis>NOT</emphasis> use
2249       <filename>ghc/compiler/ghc</filename>, or
2250       <filename>ghc/compiler/ghc-6.xx</filename>, as these are the
2251       scripts intended for installation, and contain hard-wired paths
2252       to the installed libraries, rather than the libraries in the
2253       build tree.</para>
2254
2255       <para>Happy can similarly be run from the build tree, using
2256       <filename>happy/src/happy-inplace</filename>, and similarly for
2257       Alex and Haddock.</para>
2258     </sect2>
2259
2260     <sect2>
2261       <title>Fast Making</title>
2262
2263       <indexterm><primary>fastmake</primary></indexterm>
2264       <indexterm><primary>dependencies, omitting</primary></indexterm>
2265       <indexterm><primary>FAST, makefile variable</primary></indexterm>
2266
2267       <para>Sometimes the dependencies get in the way: if you've made
2268       a small change to one file, and you're absolutely sure that it
2269       won't affect anything else, but you know that
2270       <command>make</command> is going to rebuild everything anyway,
2271       the following hack may be useful:</para>
2272
2273 <screen>$ gmake FAST=YES</screen>
2274
2275       <para>This tells the make system to ignore dependencies and just
2276       build what you tell it to.  In other words, it's equivalent to
2277       temporarily removing the <filename>.depend</filename> file in
2278       the current directory (where <command>mkdependHS</command> and
2279       friends store their dependency information).</para>
2280
2281       <para>A bit of history: GHC used to come with a
2282       <command>fastmake</command> script that did the above job, but
2283       GNU make provides the features we need to do it without
2284       resorting to a script.  Also, we've found that fastmaking is
2285       less useful since the advent of GHC's recompilation checker (see
2286       the User's Guide section on "Separate Compilation").</para>
2287     </sect2>
2288   </sect1>
2289
2290   <sect1 id="sec-makefile-arch">
2291     <title>The <filename>Makefile</filename> architecture</title>
2292     <indexterm><primary>makefile architecture</primary></indexterm>
2293
2294     <para><command>make</command> is great if everything
2295     works&mdash;you type <command>gmake install</command> and lo! the
2296     right things get compiled and installed in the right places.  Our
2297     goal is to make this happen often, but somehow it often doesn't;
2298     instead some weird error message eventually emerges from the
2299     bowels of a directory you didn't know existed.</para>
2300
2301     <para>The purpose of this section is to give you a road-map to
2302     help you figure out what is going right and what is going
2303     wrong.</para>
2304
2305     <sect2>
2306       <title>Debugging</title>
2307       
2308       <para>Debugging <filename>Makefile</filename>s is something of a
2309       black art, but here's a couple of tricks that we find
2310       particularly useful.  The following command allows you to see
2311       the contents of any make variable in the context of the current
2312       <filename>Makefile</filename>:</para>
2313
2314 <screen>$ make show VALUE=HS_SRCS</screen>
2315
2316       <para>where you can replace <literal>HS_SRCS</literal> with the
2317       name of any variable you wish to see the value of.</para>
2318       
2319       <para>GNU make has a <option>-d</option> option which generates
2320       a dump of the decision procedure used to arrive at a conclusion
2321       about which files should be recompiled.  Sometimes useful for
2322       tracking down problems with superfluous or missing
2323       recompilations.</para>
2324     </sect2>
2325
2326     <sect2>
2327       <title>A small project</title>
2328
2329       <para>To get started, let us look at the
2330       <filename>Makefile</filename> for an imaginary small
2331       <literal>fptools</literal> project, <literal>small</literal>.
2332       Each project in <literal>fptools</literal> has its own directory
2333       in <constant>FPTOOLS&lowbar;TOP</constant>, so the
2334       <literal>small</literal> project will have its own directory
2335       <constant>FPOOLS&lowbar;TOP/small/</constant>.  Inside the
2336       <filename>small/</filename> directory there will be a
2337       <filename>Makefile</filename>, looking something like
2338       this:</para>
2339
2340 <indexterm><primary>Makefile, minimal</primary></indexterm>
2341
2342 <programlisting># Makefile for fptools project "small"
2343
2344 TOP = ..
2345 include $(TOP)/mk/boilerplate.mk
2346
2347 SRCS = $(wildcard *.lhs) $(wildcard *.c)
2348 HS_PROG = small
2349
2350 include $(TOP)/target.mk</programlisting>
2351
2352       <para>this <filename>Makefile</filename> has three
2353       sections:</para>
2354
2355       <orderedlist>
2356         <listitem>
2357           <para>The first section includes
2358 <footnote>
2359 <para>
2360 One of the most important
2361 features of GNU <command>make</command> that we use is the ability for a <filename>Makefile</filename> to
2362 include another named file, very like <command>cpp</command>'s <literal>&num;include</literal>
2363 directive.
2364 </para>
2365 </footnote>
2366
2367           a file of &ldquo;boilerplate&rdquo; code from the level
2368           above (which in this case will be
2369           <filename>FPTOOLS&lowbar;TOP/mk/boilerplate.mk</filename><indexterm><primary>boilerplate.mk</primary></indexterm>).
2370           As its name suggests, <filename>boilerplate.mk</filename>
2371           consists of a large quantity of standard
2372           <filename>Makefile</filename> code.  We discuss this
2373           boilerplate in more detail in <xref linkend="sec-boiler"/>.
2374           <indexterm><primary>include, directive in
2375           Makefiles</primary></indexterm> <indexterm><primary>Makefile
2376           inclusion</primary></indexterm></para>
2377
2378           <para>Before the <literal>include</literal> statement, you
2379           must define the <command>make</command> variable
2380           <constant>TOP</constant><indexterm><primary>TOP</primary></indexterm>
2381           to be the directory containing the <filename>mk</filename>
2382           directory in which the <filename>boilerplate.mk</filename>
2383           file is.  It is <emphasis>not</emphasis> OK to simply say</para>
2384
2385 <programlisting>include ../mk/boilerplate.mk  # NO NO NO</programlisting>
2386
2387
2388           <para>Why?  Because the <filename>boilerplate.mk</filename>
2389           file needs to know where it is, so that it can, in turn,
2390           <literal>include</literal> other files.  (Unfortunately,
2391           when an <literal>include</literal>d file does an
2392           <literal>include</literal>, the filename is treated relative
2393           to the directory in which <command>gmake</command> is being
2394           run, not the directory in which the
2395           <literal>include</literal>d sits.)  In general,
2396           <emphasis>every file <filename>foo.mk</filename> assumes
2397           that
2398           <filename>&dollar;(TOP)/mk/foo.mk</filename>
2399           refers to itself.</emphasis> It is up to the
2400           <filename>Makefile</filename> doing the
2401           <literal>include</literal> to ensure this is the case.</para>
2402
2403           <para>Files intended for inclusion in other
2404           <filename>Makefile</filename>s are written to have the
2405           following property: <emphasis>after
2406           <filename>foo.mk</filename> is <literal>include</literal>d,
2407           it leaves <constant>TOP</constant> containing the same value
2408           as it had just before the <literal>include</literal>
2409           statement</emphasis>.  In our example, this invariant
2410           guarantees that the <literal>include</literal> for
2411           <filename>target.mk</filename> will look in the same
2412           directory as that for <filename>boilerplate.mk</filename>.</para>
2413         </listitem>
2414
2415         <listitem>
2416           <para> The second section defines the following standard
2417           <command>make</command> variables:
2418           <constant>SRCS</constant><indexterm><primary>SRCS</primary></indexterm>
2419           (the source files from which is to be built), and
2420           <constant>HS&lowbar;PROG</constant><indexterm><primary>HS&lowbar;PROG</primary></indexterm>
2421           (the executable binary to be built).  We will discuss in
2422           more detail what the &ldquo;standard variables&rdquo; are,
2423           and how they affect what happens, in <xref
2424           linkend="sec-targets"/>.</para>
2425
2426           <para>The definition for <constant>SRCS</constant> uses the
2427           useful GNU <command>make</command> construct
2428           <literal>&dollar;(wildcard&nbsp;$pat$)</literal><indexterm><primary>wildcard</primary></indexterm>,
2429           which expands to a list of all the files matching the
2430           pattern <literal>pat</literal> in the current directory.  In
2431           this example, <constant>SRCS</constant> is set to the list
2432           of all the <filename>.lhs</filename> and
2433           <filename>.c</filename> files in the directory.  (Let's
2434           suppose there is one of each, <filename>Foo.lhs</filename>
2435           and <filename>Baz.c</filename>.)</para>
2436         </listitem>
2437
2438         <listitem>
2439           <para>The last section includes a second file of standard
2440           code, called
2441           <filename>target.mk</filename><indexterm><primary>target.mk</primary></indexterm>.
2442           It contains the rules that tell <command>gmake</command> how
2443           to make the standard targets (<xref
2444           linkend="sec-standard-targets"/>).  Why, you ask, can't this
2445           standard code be part of
2446           <filename>boilerplate.mk</filename>?  Good question.  We
2447           discuss the reason later, in <xref
2448           linkend="sec-boiler-arch"/>.</para>
2449
2450           <para>You do not <emphasis>have</emphasis> to
2451           <literal>include</literal> the
2452           <filename>target.mk</filename> file.  Instead, you can write
2453           rules of your own for all the standard targets.  Usually,
2454           though, you will find quite a big payoff from using the
2455           canned rules in <filename>target.mk</filename>; the price
2456           tag is that you have to understand what canned rules get
2457           enabled, and what they do (<xref
2458           linkend="sec-targets"/>).</para>
2459         </listitem>
2460       </orderedlist>
2461
2462       <para>In our example <filename>Makefile</filename>, most of the
2463       work is done by the two <literal>include</literal>d files.  When
2464       you say <command>gmake all</command>, the following things
2465       happen:</para>
2466
2467       <itemizedlist>
2468         <listitem>
2469           <para><command>gmake</command> figures out that the object
2470           files are <filename>Foo.o</filename> and
2471           <filename>Baz.o</filename>.</para>
2472         </listitem>
2473
2474         <listitem>
2475           <para>It uses a boilerplate pattern rule to compile
2476           <filename>Foo.lhs</filename> to <filename>Foo.o</filename>
2477           using a Haskell compiler.  (Which one?  That is set in the
2478           build configuration.)</para>
2479         </listitem>
2480
2481         <listitem>
2482           <para>It uses another standard pattern rule to compile
2483           <filename>Baz.c</filename> to <filename>Baz.o</filename>,
2484           using a C compiler.  (Ditto.)</para>
2485         </listitem>
2486
2487         <listitem>
2488           <para>It links the resulting <filename>.o</filename> files
2489           together to make <literal>small</literal>, using the Haskell
2490           compiler to do the link step.  (Why not use
2491           <command>ld</command>?  Because the Haskell compiler knows
2492           what standard libraries to link in.  How did
2493           <command>gmake</command> know to use the Haskell compiler to
2494           do the link, rather than the C compiler?  Because we set the
2495           variable <constant>HS&lowbar;PROG</constant> rather than
2496           <constant>C&lowbar;PROG</constant>.)</para>
2497         </listitem>
2498       </itemizedlist>
2499
2500       <para>All <filename>Makefile</filename>s should follow the above
2501       three-section format.</para>
2502     </sect2>
2503
2504     <sect2>
2505       <title>A larger project</title>
2506
2507       <para>Larger projects are usually structured into a number of
2508       sub-directories, each of which has its own
2509       <filename>Makefile</filename>.  (In very large projects, this
2510       sub-structure might be iterated recursively, though that is
2511       rare.)  To give you the idea, here's part of the directory
2512       structure for the (rather large) GHC project:</para>
2513
2514 <programlisting>$(FPTOOLS_TOP)/ghc/
2515   Makefile
2516   mk/
2517     boilerplate.mk
2518     rules.mk
2519    docs/
2520     Makefile
2521     ...source files for documentation...
2522    driver/
2523     Makefile
2524     ...source files for driver...
2525    compiler/
2526     Makefile
2527     parser/...source files for parser...
2528     renamer/...source files for renamer...
2529     ...etc...</programlisting>
2530
2531       <para>The sub-directories <filename>docs</filename>,
2532       <filename>driver</filename>, <filename>compiler</filename>, and
2533       so on, each contains a sub-component of GHC, and each has its
2534       own <filename>Makefile</filename>.  There must also be a
2535       <filename>Makefile</filename> in
2536       <filename>&dollar;(FPTOOLS&lowbar;TOP)/ghc</filename>.
2537       It does most of its work by recursively invoking
2538       <command>gmake</command> on the <filename>Makefile</filename>s
2539       in the sub-directories.  We say that
2540       <filename>ghc/Makefile</filename> is a <emphasis>non-leaf
2541       <filename>Makefile</filename></emphasis>, because it does little
2542       except organise its children, while the
2543       <filename>Makefile</filename>s in the sub-directories are all
2544       <emphasis>leaf <filename>Makefile</filename>s</emphasis>.  (In
2545       principle the sub-directories might themselves contain a
2546       non-leaf <filename>Makefile</filename> and several
2547       sub-sub-directories, but that does not happen in GHC.)</para>
2548
2549       <para>The <filename>Makefile</filename> in
2550       <filename>ghc/compiler</filename> is considered a leaf
2551       <filename>Makefile</filename> even though the
2552       <filename>ghc/compiler</filename> has sub-directories, because
2553       these sub-directories do not themselves have
2554       <filename>Makefile</filename>s in them.  They are just used to
2555       structure the collection of modules that make up GHC, but all
2556       are managed by the single <filename>Makefile</filename> in
2557       <filename>ghc/compiler</filename>.</para>
2558
2559       <para>You will notice that <filename>ghc/</filename> also
2560       contains a directory <filename>ghc/mk/</filename>.  It contains
2561       GHC-specific <filename>Makefile</filename> boilerplate code.
2562       More precisely:</para>
2563
2564       <itemizedlist>
2565         <listitem>
2566           <para><filename>ghc/mk/boilerplate.mk</filename> is included
2567           at the top of <filename>ghc/Makefile</filename>, and of all
2568           the leaf <filename>Makefile</filename>s in the
2569           sub-directories.  It in turn <literal>include</literal>s the
2570           main boilerplate file
2571           <filename>mk/boilerplate.mk</filename>.</para>
2572         </listitem>
2573
2574         <listitem>
2575           <para><filename>ghc/mk/target.mk</filename> is
2576           <literal>include</literal>d at the bottom of
2577           <filename>ghc/Makefile</filename>, and of all the leaf
2578           <filename>Makefile</filename>s in the sub-directories.  It
2579           in turn <literal>include</literal>s the file
2580           <filename>mk/target.mk</filename>.</para>
2581         </listitem>
2582       </itemizedlist>
2583
2584       <para>So these two files are the place to look for GHC-wide
2585       customisation of the standard boilerplate.</para>
2586     </sect2>
2587
2588     <sect2 id="sec-boiler-arch">
2589       <title>Boilerplate architecture</title>
2590       <indexterm><primary>boilerplate architecture</primary></indexterm>
2591
2592       <para>Every <filename>Makefile</filename> includes a
2593       <filename>boilerplate.mk</filename><indexterm><primary>boilerplate.mk</primary></indexterm>
2594       file at the top, and
2595       <filename>target.mk</filename><indexterm><primary>target.mk</primary></indexterm>
2596       file at the bottom.  In this section we discuss what is in these
2597       files, and why there have to be two of them.  In general:</para>
2598
2599       <itemizedlist>
2600         <listitem>
2601           <para><filename>boilerplate.mk</filename> consists of:</para>
2602
2603           <itemizedlist>
2604             <listitem>
2605               <para><emphasis>Definitions of millions of
2606               <command>make</command> variables</emphasis> that
2607               collectively specify the build configuration.  Examples:
2608               <constant>HC&lowbar;OPTS</constant><indexterm><primary>HC&lowbar;OPTS</primary></indexterm>,
2609               the options to feed to the Haskell compiler;
2610               <constant>NoFibSubDirs</constant><indexterm><primary>NoFibSubDirs</primary></indexterm>,
2611               the sub-directories to enable within the
2612               <literal>nofib</literal> project;
2613               <constant>GhcWithHc</constant><indexterm><primary>GhcWithHc</primary></indexterm>,
2614               the name of the Haskell compiler to use when compiling
2615               GHC in the <literal>ghc</literal> project.</para>
2616             </listitem>
2617
2618             <listitem>
2619               <para><emphasis>Standard pattern rules</emphasis> that
2620               tell <command>gmake</command> how to construct one file
2621               from another.</para>
2622             </listitem>
2623           </itemizedlist>
2624
2625           <para><filename>boilerplate.mk</filename> needs to be
2626           <literal>include</literal>d at the <emphasis>top</emphasis>
2627           of each <filename>Makefile</filename>, so that the user can
2628           replace the boilerplate definitions or pattern rules by
2629           simply giving a new definition or pattern rule in the
2630           <filename>Makefile</filename>.  <command>gmake</command>
2631           simply takes the last definition as the definitive one.</para>
2632
2633           <para>Instead of <emphasis>replacing</emphasis> boilerplate
2634           definitions, it is also quite common to
2635           <emphasis>augment</emphasis> them. For example, a
2636           <filename>Makefile</filename> might say:</para>
2637
2638 <programlisting>SRC_HC_OPTS += -O</programlisting>
2639
2640           <para>thereby adding &ldquo;<option>-O</option>&rdquo; to
2641           the end of
2642           <constant>SRC&lowbar;HC&lowbar;OPTS</constant><indexterm><primary>SRC&lowbar;HC&lowbar;OPTS</primary></indexterm>.</para>
2643         </listitem>
2644
2645         <listitem>
2646           <para><filename>target.mk</filename> contains
2647           <command>make</command> rules for the standard targets
2648           described in <xref linkend="sec-standard-targets"/>.  These
2649           rules are selectively included, depending on the setting of
2650           certain <command>make</command> variables.  These variables
2651           are usually set in the middle section of the
2652           <filename>Makefile</filename> between the two
2653           <literal>include</literal>s.</para>
2654
2655           <para><filename>target.mk</filename> must be included at the
2656           end (rather than being part of
2657           <filename>boilerplate.mk</filename>) for several tiresome
2658           reasons:</para>
2659
2660           <itemizedlist>
2661             <listitem>
2662
2663               <para><command>gmake</command> commits target and
2664               dependency lists earlier than it should.  For example,
2665               <filename>target.mk</filename> has a rule that looks
2666               like this:</para>
2667
2668 <programlisting>$(HS_PROG) : $(OBJS)
2669       $(HC) $(LD_OPTS) $&#60; -o $@</programlisting>
2670
2671               <para>If this rule was in
2672               <filename>boilerplate.mk</filename> then
2673               <constant>&dollar;(HS&lowbar;PROG)</constant><indexterm><primary>HS&lowbar;PROG</primary></indexterm>
2674               and
2675               <constant>&dollar;(OBJS)</constant><indexterm><primary>OBJS</primary></indexterm>
2676               would not have their final values at the moment
2677               <command>gmake</command> encountered the rule.  Alas,
2678               <command>gmake</command> takes a snapshot of their
2679               current values, and wires that snapshot into the rule.
2680               (In contrast, the commands executed when the rule
2681               &ldquo;fires&rdquo; are only substituted at the moment
2682               of firing.)  So, the rule must follow the definitions
2683               given in the <filename>Makefile</filename> itself.</para>
2684             </listitem>
2685
2686             <listitem>
2687               <para>Unlike pattern rules, ordinary rules cannot be
2688               overriden or replaced by subsequent rules for the same
2689               target (at least, not without an error message).
2690               Including ordinary rules in
2691               <filename>boilerplate.mk</filename> would prevent the
2692               user from writing rules for specific targets in specific
2693               cases.</para>
2694             </listitem>
2695
2696             <listitem>
2697               <para>There are a couple of other reasons I've
2698               forgotten, but it doesn't matter too much.</para>
2699             </listitem>
2700           </itemizedlist>
2701         </listitem>
2702       </itemizedlist>
2703     </sect2>
2704
2705     <sect2 id="sec-boiler">
2706       <title>The main <filename>mk/boilerplate.mk</filename> file</title>
2707       <indexterm><primary>boilerplate.mk</primary></indexterm>
2708
2709       <para>If you look at
2710       <filename>&dollar;(FPTOOLS&lowbar;TOP)/mk/boilerplate.mk</filename>
2711       you will find that it consists of the following sections, each
2712       held in a separate file:</para>
2713
2714       <variablelist>
2715         <varlistentry>
2716           <term><filename>config.mk</filename>
2717             <indexterm><primary>config.mk</primary></indexterm>
2718           </term>
2719           <listitem>
2720             <para>is the build configuration file we discussed at
2721             length in <xref linkend="sec-build-config"/>.</para>
2722           </listitem>
2723         </varlistentry>
2724
2725         <varlistentry>
2726           <term><filename>paths.mk</filename>
2727             <indexterm><primary>paths.mk</primary></indexterm>
2728           </term>
2729           <listitem>
2730             <para>defines <command>make</command> variables for
2731             pathnames and file lists.  This file contains code for
2732             automatically compiling lists of source files and deriving
2733             lists of object files from those.  The results can be
2734             overriden in the <filename>Makefile</filename>, but in
2735             most cases the automatic setup should do the right
2736             thing.</para>
2737             
2738             <para>The following variables may be set in the
2739             <filename>Makefile</filename> to affect how the automatic
2740             source file search is done:</para>
2741
2742             <variablelist>
2743               <varlistentry>
2744                 <term><literal>ALL_DIRS</literal>
2745                   <indexterm><primary><literal>ALL_DIRS</literal></primary></indexterm>
2746                 </term>
2747                 <listitem>
2748                   <para>Set to a list of directories to search in
2749                   addition to the current directory for source
2750                   files.</para>
2751                 </listitem>
2752               </varlistentry>
2753
2754               <varlistentry>
2755                 <term><literal>EXCLUDED_SRCS</literal>
2756                   <indexterm><primary><literal>EXCLUDED_SRCS</literal></primary></indexterm>
2757                 </term>
2758                 <listitem>
2759                   <para>Set to a list of source files (relative to the
2760                   current directory) to omit from the automatic
2761                   search.  The source searching machinery is clever
2762                   enough to know that if you exclude a source file
2763                   from which other sources are derived, then the
2764                   derived sources should also be excluded.  For
2765                   example, if you set <literal>EXCLUDED_SRCS</literal>
2766                   to include <filename>Foo.y</filename>, then
2767                   <filename>Foo.hs</filename> will also be
2768                   excluded.</para>
2769                 </listitem>
2770               </varlistentry>
2771
2772               <varlistentry>
2773                 <term><literal>EXTRA_SRCS</literal>
2774                   <indexterm><primary><literal>EXTRA_SRCS</literal></primary></indexterm>
2775                 </term>
2776                   <listitem>
2777                   <para>Set to a list of extra source files (perhaps
2778                   in directories not listed in
2779                   <literal>ALL_DIRS</literal>) that should be
2780                   considered.</para>
2781                 </listitem>
2782               </varlistentry>
2783             </variablelist>
2784
2785             <para>The results of the automatic source file search are
2786             placed in the following make variables:</para>
2787
2788             <variablelist>
2789               <varlistentry>
2790                 <term><literal>SRCS</literal>
2791                   <indexterm><primary><literal>SRCS</literal></primary></indexterm>
2792                 </term>
2793                 <listitem>
2794                   <para>All source files found, sorted and without
2795                   duplicates, including those which might not exist
2796                   yet but will be derived from other existing sources.
2797                   <literal>SRCS</literal> <emphasis>can</emphasis> be
2798                   overriden if necessary, in which case the variables
2799                   below will follow suit.</para>
2800                 </listitem>
2801               </varlistentry>
2802
2803               <varlistentry>
2804                 <term><literal>HS_SRCS</literal>
2805                   <indexterm><primary><literal>HS_SRCS</literal></primary></indexterm>
2806                 </term>
2807                 <listitem>
2808                   <para>all Haskell source files in the current
2809                   directory, including those derived from other source
2810                   files (eg. Happy sources also give rise to Haskell
2811                   sources).</para>
2812                 </listitem>
2813               </varlistentry>
2814
2815               <varlistentry>
2816                 <term><literal>HS_OBJS</literal>
2817                   <indexterm><primary><literal>HS_OBJS</literal></primary></indexterm>
2818                 </term>
2819                 <listitem>
2820                   <para>Object files derived from
2821                   <literal>HS_SRCS</literal>.</para>
2822                 </listitem>
2823               </varlistentry>
2824
2825               <varlistentry>
2826                 <term><literal>HS_IFACES</literal>
2827                   <indexterm><primary><literal>HS_IFACES</literal></primary></indexterm>
2828                 </term>
2829                 <listitem>
2830                   <para>Interface files (<literal>.hi</literal> files)
2831                   derived from <literal>HS_SRCS</literal>.</para>
2832                 </listitem>
2833               </varlistentry>
2834
2835               <varlistentry>
2836                 <term><literal>C_SRCS</literal>
2837                 <indexterm><primary><literal>C_SRCS</literal></primary></indexterm>
2838                 </term>
2839                 <listitem>
2840                   <para>All C source files found.</para>
2841                 </listitem>
2842               </varlistentry>
2843
2844               <varlistentry>
2845                 <term><literal>C_OBJS</literal>
2846                 <indexterm><primary><literal>C_OBJS</literal></primary></indexterm>
2847                 </term>
2848                 <listitem>
2849                   <para>Object files derived from
2850                   <literal>C_SRCS</literal>.</para>
2851                 </listitem>
2852               </varlistentry>
2853
2854               <varlistentry>
2855                 <term><literal>SCRIPT_SRCS</literal>
2856                 <indexterm><primary><literal>SCRIPT_SRCS</literal></primary></indexterm>
2857                 </term>
2858                 <listitem>
2859                   <para>All script source files found
2860                   (<literal>.lprl</literal> files).</para>
2861                 </listitem>
2862               </varlistentry>
2863
2864               <varlistentry>
2865                 <term><literal>SCRIPT_OBJS</literal>
2866                 <indexterm><primary><literal>SCRIPT_OBJS</literal></primary></indexterm>
2867                 </term>
2868                 <listitem>
2869                   <para><quote>object</quote> files derived from
2870                   <literal>SCRIPT_SRCS</literal>
2871                   (<literal>.prl</literal> files).</para>
2872                 </listitem>
2873               </varlistentry>
2874
2875               <varlistentry>
2876                 <term><literal>HSC_SRCS</literal>
2877                 <indexterm><primary><literal>HSC_SRCS</literal></primary></indexterm>
2878                 </term>
2879                 <listitem>
2880                   <para>All <literal>hsc2hs</literal> source files
2881                   (<literal>.hsc</literal> files).</para>
2882                 </listitem>
2883               </varlistentry>
2884
2885               <varlistentry>
2886                 <term><literal>HAPPY_SRCS</literal>
2887                 <indexterm><primary><literal>HAPPY_SRCS</literal></primary></indexterm>
2888                 </term>
2889                 <listitem>
2890                   <para>All <literal>happy</literal> source files
2891                   (<literal>.y</literal> or <literal>.hy</literal> files).</para>
2892                 </listitem>
2893               </varlistentry>
2894
2895               <varlistentry>
2896                 <term><literal>OBJS</literal>
2897                 <indexterm><primary>OBJS</primary></indexterm>
2898                 </term>
2899                 <listitem>
2900                   <para>the concatenation of
2901                   <literal>&dollar;(HS_OBJS)</literal>,
2902                   <literal>&dollar;(C_OBJS)</literal>, and
2903                   <literal>&dollar;(SCRIPT_OBJS)</literal>.</para>
2904                 </listitem>
2905               </varlistentry>
2906             </variablelist>
2907
2908             <para>Any or all of these definitions can easily be
2909             overriden by giving new definitions in your
2910             <filename>Makefile</filename>.</para>
2911
2912             <para>What, exactly, does <filename>paths.mk</filename>
2913             consider a <quote>source file</quote> to be?  It's based
2914             on the file's suffix (e.g. <filename>.hs</filename>,
2915             <filename>.lhs</filename>, <filename>.c</filename>,
2916             <filename>.hy</filename>, etc), but this is the kind of
2917             detail that changes, so rather than enumerate the source
2918             suffices here the best thing to do is to look in
2919             <filename>paths.mk</filename>.</para>
2920           </listitem>
2921         </varlistentry>
2922
2923         <varlistentry>
2924           <term><filename>opts.mk</filename>
2925             <indexterm><primary>opts.mk</primary></indexterm>
2926           </term>
2927           <listitem>
2928             <para>defines <command>make</command> variables for option
2929             strings to pass to each program. For example, it defines
2930             <constant>HC&lowbar;OPTS</constant><indexterm><primary>HC&lowbar;OPTS</primary></indexterm>,
2931             the option strings to pass to the Haskell compiler.  See
2932             <xref linkend="sec-suffix"/>.</para>
2933           </listitem>
2934         </varlistentry>
2935
2936         <varlistentry>
2937           <term><filename>suffix.mk</filename>
2938             <indexterm><primary>suffix.mk</primary></indexterm>
2939           </term>
2940           <listitem>
2941             <para>defines standard pattern rules&mdash;see <xref
2942             linkend="sec-suffix"/>.</para>
2943           </listitem>
2944         </varlistentry>
2945       </variablelist>
2946
2947       <para>Any of the variables and pattern rules defined by the
2948       boilerplate file can easily be overridden in any particular
2949       <filename>Makefile</filename>, because the boilerplate
2950       <literal>include</literal> comes first.  Definitions after this
2951       <literal>include</literal> directive simply override the default
2952       ones in <filename>boilerplate.mk</filename>.</para>
2953     </sect2>
2954
2955     <sect2 id="sec-platforms">
2956       <title>Platform settings</title>
2957       <indexterm><primary>Platform settings</primary>
2958       </indexterm>
2959
2960       <para>There are three platforms of interest when building GHC:</para>
2961       
2962       <variablelist>
2963         <varlistentry>
2964           <term>The <emphasis>build</emphasis> platform</term>
2965           <listitem>
2966             <para>The platform on which we are doing this build.</para>
2967           </listitem>
2968         </varlistentry>
2969
2970         <varlistentry>
2971           <term>The <emphasis>host</emphasis> platform</term>
2972           <listitem>
2973             <para>The platform on which these binaries will run.</para>
2974           </listitem>
2975         </varlistentry>
2976
2977         <varlistentry>
2978           <term>The <emphasis>target</emphasis> platform</term>
2979           <listitem>
2980             <para>The platform for which this compiler will generate code.</para>
2981           </listitem>
2982         </varlistentry>
2983       </variablelist>
2984       
2985       <para>These platforms are set when running the
2986         <literal>configure</literal> script, using the
2987         <option>--build</option>, <option>--host</option>, and
2988         <option>--target</option> options.  The <filename>mk/config.mk</filename>
2989         file defines several symbols related to the platform settings (see
2990         <filename>mk/config.mk</filename> for details).</para>
2991
2992       <para>We don't currently support build &amp; host being different, because
2993         the build process creates binaries that are both run during the build,
2994         and also installed.</para>
2995
2996       <para>If host and target are different, then we are building a
2997         cross-compiler.  For GHC, this means a compiler
2998         which will generate intermediate .hc files to port to the target
2999         architecture for bootstrapping.  The libraries and stage 2 compiler
3000         will be built as HC files for the target system (see <xref
3001           linkend="sec-porting-ghc" /> for details.</para>
3002
3003       <para>More details on when to use BUILD, HOST or TARGET can be found in
3004         the comments in <filename>config.mk</filename>.</para>
3005     </sect2>
3006
3007     <sect2 id="sec-suffix">
3008       <title>Pattern rules and options</title>
3009       <indexterm><primary>Pattern rules</primary></indexterm>
3010
3011       <para>The file
3012       <filename>suffix.mk</filename><indexterm><primary>suffix.mk</primary></indexterm>
3013       defines standard <emphasis>pattern rules</emphasis> that say how
3014       to build one kind of file from another, for example, how to
3015       build a <filename>.o</filename> file from a
3016       <filename>.c</filename> file.  (GNU <command>make</command>'s
3017       <emphasis>pattern rules</emphasis> are more powerful and easier
3018       to use than Unix <command>make</command>'s <emphasis>suffix
3019       rules</emphasis>.)</para>
3020
3021       <para>Almost all the rules look something like this:</para>
3022
3023 <programlisting>%.o : %.c
3024       $(RM) $@
3025       $(CC) $(CC_OPTS) -c $&#60; -o $@</programlisting>
3026
3027       <para>Here's how to understand the rule.  It says that
3028       <emphasis>something</emphasis><filename>.o</filename> (say
3029       <filename>Foo.o</filename>) can be built from
3030       <emphasis>something</emphasis><filename>.c</filename>
3031       (<filename>Foo.c</filename>), by invoking the C compiler (path
3032       name held in <constant>&dollar;(CC)</constant>), passing to it
3033       the options <constant>&dollar;(CC&lowbar;OPTS)</constant> and
3034       the rule's dependent file of the rule
3035       <literal>&dollar;&lt;</literal> (<filename>Foo.c</filename> in
3036       this case), and putting the result in the rule's target
3037       <literal>&dollar;@</literal> (<filename>Foo.o</filename> in this
3038       case).</para>
3039
3040       <para>Every program is held in a <command>make</command>
3041       variable defined in <filename>mk/config.mk</filename>&mdash;look
3042       in <filename>mk/config.mk</filename> for the complete list.  One
3043       important one is the Haskell compiler, which is called
3044       <constant>&dollar;(HC)</constant>.</para>
3045
3046       <para>Every program's options are are held in a
3047       <command>make</command> variables called
3048       <constant>&lt;prog&gt;&lowbar;OPTS</constant>.  the
3049       <constant>&lt;prog&gt;&lowbar;OPTS</constant> variables are
3050       defined in <filename>mk/opts.mk</filename>.  Almost all of them
3051       are defined like this:</para>
3052
3053 <programlisting>CC_OPTS = \
3054   $(SRC_CC_OPTS) $(WAY$(_way)_CC_OPTS) $($*_CC_OPTS) $(EXTRA_CC_OPTS)</programlisting>
3055
3056       <para>The four variables from which
3057        <constant>CC&lowbar;OPTS</constant> is built have the following
3058       meaning:</para>
3059
3060       <variablelist>
3061         <varlistentry>
3062           <term><constant>SRC&lowbar;CC&lowbar;OPTS</constant><indexterm><primary>SRC&lowbar;CC&lowbar;OPTS</primary></indexterm>:</term>
3063           <listitem>
3064             <para>options passed to all C compilations.</para>
3065           </listitem>
3066         </varlistentry>
3067
3068         <varlistentry>
3069           <term><constant>WAY&lowbar;&lt;way&gt;&lowbar;CC&lowbar;OPTS</constant>:</term>
3070           <listitem>
3071             <para>options passed to C compilations for way
3072             <literal>&lt;way&gt;</literal>. For example,
3073             <constant>WAY&lowbar;mp&lowbar;CC&lowbar;OPTS</constant>
3074             gives options to pass to the C compiler when compiling way
3075             <literal>mp</literal>.  The variable
3076             <constant>WAY&lowbar;CC&lowbar;OPTS</constant> holds
3077             options to pass to the C compiler when compiling the
3078             standard way.  (<xref linkend="sec-ways"/> dicusses
3079             multi-way compilation.)</para>
3080           </listitem>
3081         </varlistentry>
3082
3083         <varlistentry>
3084           <term><constant>&lt;module&gt;&lowbar;CC&lowbar;OPTS</constant>:</term>
3085           <listitem>
3086             <para>options to pass to the C compiler that are specific
3087             to module <literal>&lt;module&gt;</literal>.  For example,
3088             <constant>SMap&lowbar;CC&lowbar;OPTS</constant> gives the
3089             specific options to pass to the C compiler when compiling
3090             <filename>SMap.c</filename>.</para>
3091           </listitem>
3092         </varlistentry>
3093
3094         <varlistentry>
3095           <term><constant>EXTRA&lowbar;CC&lowbar;OPTS</constant><indexterm><primary>EXTRA&lowbar;CC&lowbar;OPTS</primary></indexterm>:</term>
3096           <listitem>
3097             <para>extra options to pass to all C compilations.  This
3098             is intended for command line use, thus:</para>
3099
3100 <screen>$ gmake libHS.a EXTRA_CC_OPTS="-v"</screen>
3101           </listitem>
3102         </varlistentry>
3103       </variablelist>
3104     </sect2>
3105
3106     <sect2 id="sec-targets">
3107       <title>The main <filename>mk/target.mk</filename> file</title>
3108       <indexterm><primary>target.mk</primary></indexterm>
3109
3110       <para><filename>target.mk</filename> contains canned rules for
3111       all the standard targets described in <xref
3112       linkend="sec-standard-targets"/>.  It is complicated by the fact
3113       that you don't want all of these rules to be active in every
3114       <filename>Makefile</filename>.  Rather than have a plethora of
3115       tiny files which you can include selectively, there is a single
3116       file, <filename>target.mk</filename>, which selectively includes
3117       rules based on whether you have defined certain variables in
3118       your <filename>Makefile</filename>.  This section explains what
3119       rules you get, what variables control them, and what the rules
3120       do.  Hopefully, you will also get enough of an idea of what is
3121       supposed to happen that you can read and understand any weird
3122       special cases yourself.</para>
3123
3124       <variablelist>
3125         <varlistentry>
3126           <term><constant>HS&lowbar;PROG</constant><indexterm><primary>HS&lowbar;PROG</primary></indexterm>.</term>
3127           <listitem>
3128             <para>If <constant>HS&lowbar;PROG</constant> is defined,
3129             you get rules with the following targets:</para>
3130
3131             <variablelist>
3132               <varlistentry>
3133                 <term><filename>HS&lowbar;PROG</filename><indexterm><primary>HS&lowbar;PROG</primary></indexterm></term>
3134                 <listitem>
3135                   <para>itself.  This rule links
3136                   <constant>&dollar;(OBJS)</constant> with the Haskell
3137                   runtime system to get an executable called
3138                   <constant>&dollar;(HS&lowbar;PROG)</constant>.</para>
3139                 </listitem>
3140               </varlistentry>
3141
3142               <varlistentry>
3143                 <term><literal>install</literal><indexterm><primary>install</primary></indexterm></term>
3144                 <listitem>
3145                   <para>installs
3146                   <constant>&dollar;(HS&lowbar;PROG)</constant> in
3147                   <constant>&dollar;(bindir)</constant>.</para>
3148                 </listitem>
3149               </varlistentry>
3150             </variablelist>
3151
3152           </listitem>
3153         </varlistentry>
3154
3155         <varlistentry>
3156           <term><constant>C&lowbar;PROG</constant><indexterm><primary>C&lowbar;PROG</primary></indexterm></term>
3157           <listitem>
3158             <para>is similar to <constant>HS&lowbar;PROG</constant>,
3159             except that the link step links
3160             <constant>&dollar;(C&lowbar;OBJS)</constant> with the C
3161             runtime system.</para>
3162           </listitem>
3163         </varlistentry>
3164
3165         <varlistentry>
3166           <term><constant>LIBRARY</constant><indexterm><primary>LIBRARY</primary></indexterm></term>
3167           <listitem>
3168             <para>is similar to <constant>HS&lowbar;PROG</constant>,
3169             except that it links
3170             <constant>&dollar;(LIB&lowbar;OBJS)</constant> to make the
3171             library archive <constant>&dollar;(LIBRARY)</constant>,
3172             and <literal>install</literal> installs it in
3173             <constant>&dollar;(libdir)</constant>.</para>
3174           </listitem>
3175         </varlistentry>
3176
3177         <varlistentry>
3178           <term><constant>LIB&lowbar;DATA</constant><indexterm><primary>LIB&lowbar;DATA</primary></indexterm></term>
3179           <listitem>
3180             <para>&hellip;</para>
3181           </listitem>
3182         </varlistentry>
3183
3184         <varlistentry>
3185           <term><constant>LIB&lowbar;EXEC</constant><indexterm><primary>LIB&lowbar;EXEC</primary></indexterm></term>
3186           <listitem>
3187             <para>&hellip;</para>
3188           </listitem>
3189         </varlistentry>
3190
3191         <varlistentry>
3192           <term><constant>HS&lowbar;SRCS</constant><indexterm><primary>HS&lowbar;SRCS</primary></indexterm>, <constant>C&lowbar;SRCS</constant><indexterm><primary>C&lowbar;SRCS</primary></indexterm>.</term>
3193           <listitem>
3194             <para>If <constant>HS&lowbar;SRCS</constant> is defined
3195             and non-empty, a rule for the target
3196             <literal>depend</literal> is included, which generates
3197             dependency information for Haskell programs.  Similarly
3198             for <constant>C&lowbar;SRCS</constant>.</para>
3199           </listitem>
3200         </varlistentry>
3201       </variablelist>
3202
3203       <para>All of these rules are &ldquo;double-colon&rdquo; rules,
3204       thus</para>
3205
3206 <programlisting>install :: $(HS_PROG)
3207       ...how to install it...</programlisting>
3208
3209       <para>GNU <command>make</command> treats double-colon rules as
3210       separate entities.  If there are several double-colon rules for
3211       the same target it takes each in turn and fires it if its
3212       dependencies say to do so.  This means that you can, for
3213       example, define both <constant>HS&lowbar;PROG</constant> and
3214       <constant>LIBRARY</constant>, which will generate two rules for
3215       <literal>install</literal>.  When you type <command>gmake
3216       install</command> both rules will be fired, and both the program
3217       and the library will be installed, just as you wanted.</para>
3218     </sect2>
3219
3220     <sect2 id="sec-subdirs">
3221       <title>Recursion</title>
3222       <indexterm><primary>recursion, in makefiles</primary></indexterm>
3223       <indexterm><primary>Makefile, recursing into subdirectories</primary></indexterm>
3224
3225       <para>In leaf <filename>Makefile</filename>s the variable
3226       <constant>SUBDIRS</constant><indexterm><primary>SUBDIRS</primary></indexterm>
3227       is undefined.  In non-leaf <filename>Makefile</filename>s,
3228       <constant>SUBDIRS</constant> is set to the list of
3229       sub-directories that contain subordinate
3230       <filename>Makefile</filename>s.  <emphasis>It is up to you to
3231       set <constant>SUBDIRS</constant> in the
3232       <filename>Makefile</filename>.</emphasis> There is no automation
3233       here&mdash;<constant>SUBDIRS</constant> is too important to
3234       automate.</para>
3235
3236       <para>When <constant>SUBDIRS</constant> is defined,
3237       <filename>target.mk</filename> includes a rather neat rule for
3238       the standard targets (<xref linkend="sec-standard-targets"/> that
3239       simply invokes <command>make</command> recursively in each of
3240       the sub-directories.</para>
3241
3242       <para><emphasis>These recursive invocations are guaranteed to
3243       occur in the order in which the list of directories is specified
3244       in <constant>SUBDIRS</constant>. </emphasis>This guarantee can
3245       be important.  For example, when you say <command>gmake
3246       boot</command> it can be important that the recursive invocation
3247       of <command>make boot</command> is done in one sub-directory
3248       (the include files, say) before another (the source files).
3249       Generally, put the most independent sub-directory first, and the
3250       most dependent last.</para>
3251     </sect2>
3252
3253     <sect2 id="sec-ways">
3254       <title>Way management</title>
3255       <indexterm><primary>way management</primary></indexterm>
3256
3257       <para>We sometimes want to build essentially the same system in
3258       several different &ldquo;ways&rdquo;.  For example, we want to build GHC's
3259       <literal>Prelude</literal> libraries with and without profiling,
3260       so that there is an appropriately-built library archive to link
3261       with when the user compiles his program.  It would be possible
3262       to have a completely separate build tree for each such &ldquo;way&rdquo;,
3263       but it would be horribly bureaucratic, especially since often
3264       only parts of the build tree need to be constructed in multiple
3265       ways.</para>
3266
3267       <para>Instead, the
3268       <filename>target.mk</filename><indexterm><primary>target.mk</primary></indexterm>
3269       contains some clever magic to allow you to build several
3270       versions of a system; and to control locally how many versions
3271       are built and how they differ.  This section explains the
3272       magic.</para>
3273
3274       <para>The files for a particular way are distinguished by
3275       munging the suffix.  The <quote>normal way</quote> is always
3276       built, and its files have the standard suffices
3277       <filename>.o</filename>, <filename>.hi</filename>, and so on.
3278       In addition, you can build one or more extra ways, each
3279       distinguished by a <emphasis>way tag</emphasis>.  The object
3280       files and interface files for one of these extra ways are
3281       distinguished by their suffix.  For example, way
3282       <literal>mp</literal> has files
3283       <filename>.mp&lowbar;o</filename> and
3284       <filename>.mp&lowbar;hi</filename>.  Library archives have their
3285       way tag the other side of the dot, for boring reasons; thus,
3286       <filename>libHS&lowbar;mp.a</filename>.</para>
3287
3288       <para>A <command>make</command> variable called
3289       <constant>way</constant> holds the current way tag.
3290       <emphasis><constant>way</constant> is only ever set on the
3291       command line of <command>gmake</command></emphasis> (usually in
3292       a recursive invocation of <command>gmake</command> by the
3293       system).  It is never set inside a
3294       <filename>Makefile</filename>.  So it is a global constant for
3295       any one invocation of <command>gmake</command>.  Two other
3296       <command>make</command> variables,
3297       <constant>way&lowbar;</constant> and
3298       <constant>&lowbar;way</constant> are immediately derived from
3299       <constant>&dollar;(way)</constant> and never altered.  If
3300       <constant>way</constant> is not set, then neither are
3301       <constant>way&lowbar;</constant> and
3302       <constant>&lowbar;way</constant>, and the invocation of
3303       <command>make</command> will build the <quote>normal
3304       way</quote>.  If <constant>way</constant> is set, then the other
3305       two variables are set in sympathy.  For example, if
3306       <constant>&dollar;(way)</constant> is &ldquo;<literal>mp</literal>&rdquo;,
3307       then <constant>way&lowbar;</constant> is set to
3308       &ldquo;<literal>mp&lowbar;</literal>&rdquo; and
3309       <constant>&lowbar;way</constant> is set to
3310       &ldquo;<literal>&lowbar;mp</literal>&rdquo;.  These three variables are
3311       then used when constructing file names.</para>
3312
3313       <para>So how does <command>make</command> ever get recursively
3314       invoked with <constant>way</constant> set?  There are two ways
3315       in which this happens:</para>
3316
3317       <itemizedlist>
3318         <listitem>
3319           <para>For some (but not all) of the standard targets, when
3320           in a leaf sub-directory, <command>make</command> is
3321           recursively invoked for each way tag in
3322           <constant>&dollar;(WAYS)</constant>.  You set
3323           <constant>WAYS</constant> in the
3324           <filename>Makefile</filename> to the list of way tags you
3325           want these targets built for.  The mechanism here is very
3326           much like the recursive invocation of
3327           <command>make</command> in sub-directories (<xref
3328           linkend="sec-subdirs"/>).  It is up to you to set
3329           <constant>WAYS</constant> in your
3330           <filename>Makefile</filename>; this is how you control what
3331           ways will get built.</para>
3332         </listitem>
3333
3334         <listitem>
3335           <para>For a useful collection of targets (such as
3336           <filename>libHS&lowbar;mp.a</filename>,
3337           <filename>Foo.mp&lowbar;o</filename>) there is a rule which
3338           recursively invokes <command>make</command> to make the
3339           specified target, setting the <constant>way</constant>
3340           variable.  So if you say <command>gmake
3341           Foo.mp&lowbar;o</command> you should see a recursive
3342           invocation <command>gmake Foo.mp&lowbar;o way=mp</command>,
3343           and <emphasis>in this recursive invocation the pattern rule
3344           for compiling a Haskell file into a <filename>.o</filename>
3345           file will match</emphasis>.  The key pattern rules (in
3346           <filename>suffix.mk</filename>) look like this:
3347
3348 <programlisting>%.$(way_)o : %.lhs
3349       $(HC) $(HC_OPTS) $&#60; -o $@</programlisting>
3350
3351           Neat, eh?</para>
3352         </listitem>
3353
3354         <listitem>
3355           <para>You can invoke <command>make</command> with a
3356           particular <literal>way</literal> setting yourself, in order
3357           to build files related to a particular
3358           <literal>way</literal> in the current directory.  eg.
3359
3360 <screen>$ make way=p</screen>
3361
3362           will build files for the profiling way only in the current
3363           directory. </para>
3364         </listitem>
3365       </itemizedlist>
3366     </sect2>
3367
3368     <sect2>
3369       <title>When the canned rule isn't right</title>
3370
3371       <para>Sometimes the canned rule just doesn't do the right thing.
3372       For example, in the <literal>nofib</literal> suite we want the
3373       link step to print out timing information.  The thing to do here
3374       is <emphasis>not</emphasis> to define
3375       <constant>HS&lowbar;PROG</constant> or
3376       <constant>C&lowbar;PROG</constant>, and instead define a special
3377       purpose rule in your own <filename>Makefile</filename>.  By
3378       using different variable names you will avoid the canned rules
3379       being included, and conflicting with yours.</para>
3380     </sect2>
3381   </sect1>
3382
3383   <sect1 id="building-docs">
3384     <title>Building the documentation</title>
3385
3386     <sect2 id="pre-supposed-doc-tools">
3387       <title>Tools for building the Documentation</title>
3388
3389       <para>The following additional tools are required if you want to
3390       format the documentation that comes with the
3391       <literal>fptools</literal> projects:</para>
3392       
3393       <variablelist>
3394         <varlistentry>
3395           <term>DocBook
3396             <indexterm><primary>pre-supposed: DocBook</primary></indexterm>
3397             <indexterm><primary>DocBook, pre-supposed</primary></indexterm>
3398           </term>
3399           <listitem>
3400             <para>Much of our documentation is written in DocBook XML, instructions
3401             on installing and configuring the DocBook tools are below.</para>
3402           </listitem>
3403         </varlistentry>
3404
3405         <varlistentry>
3406           <term>TeX
3407             <indexterm><primary>pre-supposed: TeX</primary></indexterm>
3408             <indexterm><primary>TeX, pre-supposed</primary></indexterm>
3409           </term>
3410           <listitem>
3411             <para>A decent TeX distribution is required if you want to
3412             produce printable documentation.  We recomment teTeX,
3413             which includes just about everything you need.</para>
3414           </listitem>
3415         </varlistentry>
3416
3417         <varlistentry>
3418           <term>Haddock
3419             <indexterm><primary>Haddock</primary></indexterm>
3420           </term>
3421           <listitem>
3422             <para>Haddock is a Haskell documentation tool that we use
3423             for automatically generating documentation from the
3424             library source code.  It is an <literal>fptools</literal>
3425             project in itself.  To build documentation for the
3426             libraries (<literal>fptools/libraries</literal>) you
3427             should check out and build Haddock in
3428             <literal>fptools/haddock</literal>.  Haddock requires GHC
3429             to build.</para>
3430           </listitem>
3431         </varlistentry>
3432       </variablelist>
3433     </sect2>
3434
3435     <sect2>
3436       <title>Installing the DocBook tools</title>
3437
3438       <sect3>
3439         <title>Installing the DocBook tools on Linux</title>
3440
3441         <para>If you're on a recent RedHat (7.0+) or SuSE (8.1+) system,
3442         you probably have working DocBook tools already installed. The
3443         configure script should detect your setup and you're away.</para>
3444
3445         <para>If you don't have DocBook tools installed, and you are
3446         using a system that can handle RPM packages, you can use <ulink
3447         url="http://rpmfind.net/">Rpmfind.net</ulink> to find suitable
3448         packages for your system. Search for the packages
3449         <literal>docbook-dtd</literal>,
3450         <literal>docbook-xsl-stylesheets</literal>,
3451         <literal>libxslt</literal>,
3452         <literal>libxml2</literal>,
3453         <literal>fop</literal>,
3454         <literal>xmltex</literal>, and
3455         <literal>dvips</literal>.</para>
3456       </sect3>
3457     
3458       <sect3>
3459         <title>Installing DocBook on FreeBSD</title>
3460
3461         <para>On FreeBSD systems, the easiest way to get DocBook up
3462         and running is to install it from the ports tree or a
3463         pre-compiled package (packages are available from your local
3464         FreeBSD mirror site).</para>
3465
3466         <para>To use the ports tree, do this:
3467 <screen>$ cd /usr/ports/textproc/docproj
3468 $ make install</screen>
3469         This installs the FreeBSD documentation project tools, which
3470         includes everything needed to format the GHC
3471         documentation.</para>
3472       </sect3>
3473
3474       <sect3>
3475         <title>Installing from binaries on Windows</title>
3476         
3477         <para>Probably the fastest route to a working DocBook environment on
3478         Windows is to install <ulink url="http://www.cygwin.com/">Cygwin</ulink>
3479         with the complete <literal>Doc</literal> category. If you are using
3480         <ulink url="http://www.mingw.org/">MinGW</ulink> for compilation, you
3481         have to help <command>configure</command> a little bit: Set the
3482         environment variables <envar>XmllintCmd</envar> and
3483         <envar>XsltprocCmd</envar> to the paths of the Cygwin executables
3484         <command>xmllint</command> and <command>xsltproc</command>,
3485         respectively, and set <envar>fp_cv_dir_docbook_xsl</envar> to the path
3486         of the directory where the XSL stylesheets are installed,
3487         e.g. <filename>c:/cygwin/usr/share/docbook-xsl</filename>.
3488         </para>
3489
3490         <para>If you want to build HTML Help, you have to install the
3491         <ulink url="http://msdn.microsoft.com/library/default.asp?url=/library/en-us/htmlhelp/html/hworiHTMLHelpStartPage.asp">HTML Help SDK</ulink>,
3492         too, and make sure that <command>hhc</command> is in your <envar>PATH</envar>.</para>
3493       </sect3>
3494
3495     </sect2>
3496
3497     <sect2>
3498       <title>Configuring the DocBook tools</title>
3499
3500       <para>Once the DocBook tools are installed, the configure script
3501       will detect them and set up the build system accordingly. If you
3502       have a system that isn't supported, let us know, and we'll try
3503       to help.</para>
3504     </sect2>
3505
3506     <sect2>
3507       <title>Building the documentation</title>
3508
3509       <para>To build documentation in a certain format, you can
3510       say, for example,</para>
3511
3512 <screen>$ make html</screen>
3513
3514       <para>to build HTML documentation below the current directory.
3515       The available formats are: <literal>dvi</literal>,
3516       <literal>ps</literal>, <literal>pdf</literal>,
3517       <literal>html</literal>, and <literal>rtf</literal>.  Note that
3518       not all documentation can be built in all of these formats: HTML
3519       documentation is generally supported everywhere, and DocBook
3520       documentation might support the other formats (depending on what
3521       other tools you have installed).</para>
3522
3523       <para>All of these targets are recursive; that is, saying
3524       <literal>make html</literal> will make HTML docs for all the
3525       documents recursively below the current directory.</para>
3526
3527       <para>Because there are many different formats that the DocBook
3528       documentation can be generated in, you have to select which ones
3529       you want by setting the <literal>XMLDocWays</literal> variable
3530       to a list of them.  For example, in
3531       <filename>build.mk</filename> you might have a line:</para>
3532
3533 <screen>XMLDocWays = html ps</screen>
3534
3535       <para>This will cause the documentation to be built in the requested
3536       formats as part of the main build (the default is not to build
3537       any documentation at all).</para>
3538     </sect2>
3539
3540     <sect2>
3541       <title>Installing the documentation</title>
3542
3543       <para>To install the documentation, use:</para>
3544
3545 <screen>$ make install-docs</screen>
3546
3547       <para>This will install the documentation into
3548       <literal>$(datadir)</literal> (which defaults to
3549       <literal>$(prefix)/share</literal>).  The exception is HTML
3550       documentation, which goes into
3551       <literal>$(datadir)/html</literal>, to keep things tidy.</para>
3552
3553       <para>Note that unless you set <literal>$(XMLDocWays)</literal>
3554       to a list of formats, the <literal>install-docs</literal> target
3555       won't do anything for DocBook XML documentation.</para>
3556     </sect2>
3557
3558   </sect1>
3559     
3560
3561   <sect1 id="sec-porting-ghc">
3562     <title>Porting GHC</title>
3563
3564     <para>This section describes how to port GHC to a currenly
3565     unsupported platform.  There are two distinct
3566     possibilities:</para>
3567
3568     <itemizedlist>
3569       <listitem>
3570         <para>The hardware architecture for your system is already
3571         supported by GHC, but you're running an OS that isn't
3572         supported (or perhaps has been supported in the past, but
3573         currently isn't).  This is the easiest type of porting job,
3574         but it still requires some careful bootstrapping.  Proceed to
3575         <xref linkend="sec-booting-from-hc"/>.</para>
3576       </listitem>
3577       
3578       <listitem>
3579         <para>Your system's hardware architecture isn't supported by
3580         GHC.  This will be a more difficult port (though by comparison
3581         perhaps not as difficult as porting gcc).  Proceed to <xref
3582         linkend="unregisterised-porting"/>.</para>
3583       </listitem>
3584     </itemizedlist>
3585     
3586     <sect2 id="sec-booting-from-hc">
3587       <title>Booting/porting from C (<filename>.hc</filename>) files</title>
3588
3589       <indexterm><primary>building GHC from .hc files</primary></indexterm>
3590       <indexterm><primary>booting GHC from .hc files</primary></indexterm>
3591       <indexterm><primary>porting GHC</primary></indexterm>
3592
3593       <para>Bootstrapping GHC on a system without GHC already
3594       installed is achieved by taking the intermediate C files (known
3595       as HC files) from another GHC compilation, compiling them using gcc to
3596         get a working GHC.</para>
3597
3598       <para><emphasis>NOTE: GHC versions 5.xx were hard to bootstrap
3599       from C.  We recommend using GHC 6.0.1 or
3600       later.</emphasis></para>
3601
3602       <para>HC files are platform-dependent, so you have to get a set
3603         that were generated on <emphasis>the same platform</emphasis>.  There
3604         may be some supplied on the GHC download page, otherwise you'll have to
3605         compile some up yourself, or start from
3606         <emphasis>unregisterised</emphasis> HC files - see <xref
3607           linkend="unregisterised-porting"/>.</para>
3608
3609       <para>The following steps should result in a working GHC build
3610       with full libraries:</para>
3611
3612       <itemizedlist>
3613         <listitem>
3614           <para>Unpack the HC files on top of a fresh source tree
3615           (make sure the source tree version matches the version of
3616           the HC files <emphasis>exactly</emphasis>!).  This will
3617           place matching <filename>.hc</filename> files next to the
3618           corresponding Haskell source (<filename>.hs</filename> or
3619           <filename>.lhs</filename>) in the compiler subdirectory
3620           <filename>ghc/compiler</filename> and in the libraries
3621           (subdirectories of 
3622           <literal>libraries</literal>).</para>
3623         </listitem>
3624
3625         <listitem>
3626           <para>The actual build process is fully automated by the
3627           <filename>hc-build</filename> script located in the
3628           <filename>distrib</filename> directory.  If you eventually
3629           want to install GHC into the directory
3630           <replaceable>dir</replaceable>, the following
3631           command will execute the whole build process (it won't
3632           install yet):</para>
3633
3634 <screen>$ distrib/hc-build --prefix=<replaceable>dir</replaceable></screen>
3635 <indexterm><primary>--hc-build</primary></indexterm>
3636
3637           <para>By default, the installation directory is
3638           <filename>/usr/local</filename>.  If that is what you want,
3639           you may omit the argument to <filename>hc-build</filename>.
3640           Generally, any option given to <filename>hc-build</filename>
3641           is passed through to the configuration script
3642           <filename>configure</filename>.  If
3643           <filename>hc-build</filename> successfully completes the
3644           build process, you can install the resulting system, as
3645           normal, with</para>
3646
3647 <screen>$ make install</screen>
3648         </listitem>
3649       </itemizedlist>
3650     </sect2>
3651
3652     <sect2 id="unregisterised-porting">
3653       <title>Porting GHC to a new architecture</title>
3654       
3655       <para>The first step in porting to a new architecture is to get
3656       an <firstterm>unregisterised</firstterm> build working.  An
3657       unregisterised build is one that compiles via vanilla C only.
3658       By contrast, a registerised build uses the following
3659       architecture-specific hacks for speed:</para>
3660
3661       <itemizedlist>
3662         <listitem>
3663           <para>Global register variables: certain abstract machine
3664           <quote>registers</quote> are mapped to real machine
3665           registers, depending on how many machine registers are
3666           available (see
3667           <filename>ghc/includes/MachRegs.h</filename>).</para>
3668         </listitem>
3669
3670         <listitem>
3671           <para>Assembly-mangling: when compiling via C, we feed the
3672           assembly generated by gcc though a Perl script known as the
3673           <firstterm>mangler</firstterm> (see
3674           <filename>ghc/driver/mangler/ghc-asm.lprl</filename>).  The
3675           mangler rearranges the assembly to support tail-calls and
3676           various other optimisations.</para>
3677         </listitem>
3678       </itemizedlist>
3679
3680       <para>In an unregisterised build, neither of these hacks are
3681       used &mdash; the idea is that the C code generated by the
3682       compiler should compile using gcc only.  The lack of these
3683       optimisations costs about a factor of two in performance, but
3684       since unregisterised compilation is usually just a step on the
3685       way to a full registerised port, we don't mind too much.</para>
3686
3687       <para>Notes on GHC portability in general: we've tried to stick
3688       to writing portable code in most parts of the system, so it
3689       should compile on any POSIXish system with gcc, but in our
3690       experience most systems differ from the standards in one way or
3691       another.  Deal with any problems as they arise - if you get
3692       stuck, ask the experts on
3693       <email>glasgow-haskell-users@haskell.org</email>.</para>
3694         
3695       <para>Lots of useful information about the innards of GHC is
3696       available in the <ulink
3697       url="http://www.cse.unsw.edu.au/~chak/haskell/ghc/comm/">GHC
3698       Commentary</ulink>, which might be helpful if you run into some
3699       code which needs tweaking for your system.</para>
3700
3701       <sect3>
3702         <title>Cross-compiling to produce an unregisterised GHC</title>
3703
3704         <para>NOTE!  These instructions apply to GHC 6.4 and (hopefully)
3705           later.  If you need instructions for an earlier version of GHC, try
3706           to get hold of the version of this document that was current at the
3707           time.  It should be available from the appropriate download page on
3708           the <ulink
3709             url="http://www.haskell.org/ghc/">GHC&nbsp;homepage</ulink>.</para>  
3710
3711         <para>In this section, we explain how to bootstrap GHC on a
3712         new platform, using unregisterised intermediate C files.  We
3713         haven't put a great deal of effort into automating this
3714         process, for two reasons: it is done very rarely, and the
3715         process usually requires human intervention to cope with minor
3716         porting issues anyway.</para>
3717
3718         <para>The following step-by-step instructions should result in
3719         a fully working, albeit unregisterised, GHC.  Firstly, you
3720         need a machine that already has a working GHC (we'll call this
3721         the <firstterm>host</firstterm> machine), in order to
3722         cross-compile the intermediate C files that we will use to
3723         bootstrap the compiler on the <firstterm>target</firstterm>
3724         machine.</para>
3725
3726         <itemizedlist>
3727           <listitem>
3728             <para>On the target machine:</para>
3729
3730           <itemizedlist>
3731             <listitem>
3732               <para>Unpack a source tree (preferably a released
3733               version).  We will call the path to the root of this
3734               tree <replaceable>T</replaceable>.</para>
3735             </listitem>
3736
3737             <listitem>
3738 <screen>$ cd <replaceable>T</replaceable>
3739 $ ./configure --enable-hc-boot --enable-hc-boot-unregisterised</screen>
3740
3741               <para>You might need to update
3742               <filename>configure.in</filename> to recognise the new
3743               architecture, and re-generate
3744               <filename>configure</filename> with
3745               <literal>autoreconf</literal>.</para>
3746             </listitem>
3747   
3748             <listitem>
3749 <screen>$ cd <replaceable>T</replaceable>/ghc/includes
3750 $ make</screen>
3751             </listitem>
3752           </itemizedlist>
3753           </listitem>
3754
3755           <listitem>
3756             <para>On the host machine:</para>
3757               
3758           <itemizedlist>
3759             <listitem>
3760               <para>Unpack a source tree (same released version).  Call
3761               this directory <replaceable>H</replaceable>.</para>
3762             </listitem>
3763  
3764             <listitem>
3765 <screen>$ cd <replaceable>H</replaceable>
3766 $ ./configure</screen>
3767             </listitem>
3768
3769             <listitem>
3770               <para>Create
3771               <filename><replaceable>H</replaceable>/mk/build.mk</filename>,
3772               with the following contents:</para>
3773
3774 <programlisting>GhcUnregisterised = YES
3775 GhcLibHcOpts = -O -fvia-C -keep-hc-files
3776 GhcRtsHcOpts = -keep-hc-files
3777 GhcLibWays =
3778 SplitObjs = NO
3779 GhcWithNativeCodeGen = NO
3780 GhcWithInterpreter = NO
3781 GhcStage1HcOpts = -O
3782 GhcStage2HcOpts = -O -fvia-C -keep-hc-files
3783 SRC_HC_OPTS += -H32m
3784 GhcBootLibs = YES</programlisting>
3785             </listitem>
3786
3787             <listitem>
3788               <para>Edit
3789               <filename><replaceable>H</replaceable>/mk/config.mk</filename>:</para>
3790               <itemizedlist>
3791                 <listitem>
3792                   <para>change <literal>TARGETPLATFORM</literal>
3793                   appropriately, and set the variables involving
3794                   <literal>TARGET</literal> to the correct values for
3795                   the target platform.  This step is necessary because
3796                   currently <literal>configure</literal> doesn't cope
3797                   with specifying different values for the
3798                   <literal>--host</literal> and
3799                   <literal>--target</literal> flags.</para>
3800                 </listitem>
3801                 <listitem>
3802                   <para>copy <literal>LeadingUnderscore</literal>
3803                   setting from target.</para>
3804                 </listitem>
3805               </itemizedlist>
3806             </listitem>
3807
3808             <listitem>
3809               <para>Copy
3810               <filename><replaceable>T</replaceable>/ghc/includes/ghcautoconf.h</filename>, <filename><replaceable>T</replaceable>/ghc/includes/DerivedConstants.h</filename>, and <filename><replaceable>T</replaceable>/ghc/includes/GHCConstants.h</filename>
3811               to
3812               <filename><replaceable>H</replaceable>/ghc/includes</filename>.
3813               Note that we are building on the host machine, using the
3814               target machine's configuration files.  This
3815               is so that the intermediate C files generated here will
3816               be suitable for compiling on the target system.</para>
3817             </listitem>
3818
3819               <listitem>
3820                 <para>Touch the generated configuration files, just to make
3821                 sure they don't get replaced during the build:</para>
3822 <screen>$ cd <filename><replaceable>H</replaceable></filename>/ghc/includes
3823 $ touch ghcautoconf.h DerivedConstants.h GHCConstants.h mkDerivedConstants.c
3824 $ touch mkDerivedConstantsHdr mkDerivedConstants.o mkGHCConstants mkGHCConstants.o</screen>
3825
3826                 <para>Note: it has been reported that these files still get
3827                   overwritten during the next stage.  We have installed a fix
3828                   for this in GHC 6.4.2, but if you are building a version
3829                   before that you need to watch out for these files getting
3830                   overwritte by the <literal>Makefile</literal> in
3831                   <literal>ghc/includes</literal>.  If your system supports
3832                   it, you might be able to prevent it by making them
3833                   immutable:</para>
3834 <screen>$ chflags uchg  ghc/includes/{ghcautoconf.h,DerivedConstants.h,GHCConstants.h}</screen>
3835               </listitem>
3836
3837             <listitem>
3838                 <para>Now build the compiler:</para>
3839 <screen>$ cd <replaceable>H</replaceable>/glafp-utils &amp;&amp; make boot &amp;&amp; make
3840 $ cd <replaceable>H</replaceable>/ghc &amp;&amp; make boot &amp;&amp; make</screen>
3841               <para>Don't worry if the build falls over in the RTS, we
3842               don't need the RTS yet.</para>
3843             </listitem>
3844
3845             <listitem>
3846 <screen>$ cd <replaceable>H</replaceable>/libraries
3847 $ make boot &amp;&amp; make</screen>
3848             </listitem>
3849
3850             <listitem>
3851 <screen>$ cd <replaceable>H</replaceable>/ghc/compiler
3852 $ make boot stage=2 &amp;&amp; make stage=2</screen>
3853             </listitem>
3854
3855             <listitem>
3856 <screen>$ cd <replaceable>H</replaceable>/ghc/lib/compat
3857 $ make clean
3858 $ rm .depend
3859 $ make boot UseStage1=YES
3860 $ make -k UseStage1=YES EXTRA_HC_OPTS='-O -fvia-C -keep-hc-files'
3861 $ cd <replaceable>H</replaceable>/ghc/utils
3862 $ make clean
3863 $ make -k UseStage1=YES EXTRA_HC_OPTS='-O -fvia-C -keep-hc-files'</screen>
3864             </listitem>
3865             
3866             <listitem>
3867 <screen>$ cd <replaceable>H</replaceable>
3868 $ make hc-file-bundle Project=Ghc</screen>
3869             </listitem>
3870
3871             <listitem>
3872               <para>copy
3873               <filename><replaceable>H</replaceable>/*-hc.tar.gz</filename>
3874               to <filename><replaceable>T</replaceable>/..</filename>.</para>
3875             </listitem>
3876           </itemizedlist>
3877           </listitem>
3878
3879           <listitem>
3880             <para>On the target machine:</para>
3881
3882             <para>At this stage we simply need to bootstrap a compiler
3883             from the intermediate C files we generated above.  The
3884             process of bootstrapping from C files is automated by the
3885             script in <literal>distrib/hc-build</literal>, and is
3886             described in <xref linkend="sec-booting-from-hc"/>.</para>
3887
3888 <screen>$ ./distrib/hc-build --enable-hc-boot-unregisterised</screen>
3889
3890             <para>However, since this is a bootstrap on a new machine,
3891             the automated process might not run to completion the
3892             first time.  For that reason, you might want to treat the
3893             <literal>hc-build</literal> script as a list of
3894             instructions to follow, rather than as a fully automated
3895             script.  This way you'll be able to restart the process
3896             part-way through if you need to fix anything on the
3897             way.</para>
3898
3899             <para>Don't bother with running
3900             <literal>make&nbsp;install</literal> in the newly
3901             bootstrapped tree; just use the compiler in that tree to
3902             build a fresh compiler from scratch, this time without
3903             booting from C files.  Before doing this, you might want
3904             to check that the bootstrapped compiler is generating
3905             working binaries:</para>
3906
3907 <screen>$ cat >hello.hs
3908 main = putStrLn "Hello World!\n"
3909 ^D
3910 $ <replaceable>T</replaceable>/ghc/compiler/ghc-inplace hello.hs -o hello
3911 $ ./hello
3912 Hello World!</screen>
3913
3914             <para>Once you have the unregisterised compiler up and
3915             running, you can use it to start a registerised port.  The
3916             following sections describe the various parts of the
3917             system that will need architecture-specific tweaks in
3918             order to get a registerised build going.</para>
3919
3920           </listitem>
3921         </itemizedlist>
3922       </sect3>
3923
3924       <sect3>
3925         <title>Porting the RTS</title>
3926         
3927         <para>The following files need architecture-specific code for a
3928         registerised build:</para>
3929
3930         <variablelist>
3931           <varlistentry>
3932             <term><filename>ghc/includes/MachRegs.h</filename>
3933             <indexterm><primary><filename>MachRegs.h</filename></primary></indexterm>
3934             </term>
3935             <listitem>
3936               <para>Defines the STG-register to machine-register
3937               mapping.  You need to know your platform's C calling
3938               convention, and which registers are generally available
3939               for mapping to global register variables.  There are
3940               plenty of useful comments in this file.</para>
3941             </listitem>
3942           </varlistentry>
3943           <varlistentry>
3944             <term><filename>ghc/includes/TailCalls.h</filename>
3945             <indexterm><primary><filename>TailCalls.h</filename></primary></indexterm>
3946             </term>
3947             <listitem>
3948               <para>Macros that cooperate with the mangler (see <xref
3949               linkend="sec-mangler"/>) to make proper tail-calls
3950               work.</para>
3951             </listitem>
3952           </varlistentry>
3953           <varlistentry>
3954             <term><filename>ghc/rts/Adjustor.c</filename>
3955             <indexterm><primary><filename>Adjustor.c</filename></primary></indexterm>
3956             </term>
3957             <listitem>
3958               <para>Support for
3959               <literal>foreign&nbsp;import&nbsp;"wrapper"</literal>
3960               (aka
3961               <literal>foreign&nbsp;export&nbsp;dynamic</literal>).
3962               Not essential for getting GHC bootstrapped, so this file
3963               can be deferred until later if necessary.</para>
3964             </listitem>
3965           </varlistentry>
3966           <varlistentry>
3967             <term><filename>ghc/rts/StgCRun.c</filename>
3968             <indexterm><primary><filename>StgCRun.c</filename></primary></indexterm>
3969             </term>
3970             <listitem>
3971               <para>The little assembly layer between the C world and
3972               the Haskell world.  See the comments and code for the
3973               other architectures in this file for pointers.</para>
3974             </listitem>
3975           </varlistentry>
3976           <varlistentry>
3977             <term><filename>ghc/rts/MBlock.h</filename>
3978               <indexterm><primary><filename>MBlock.h</filename></primary></indexterm>
3979             </term>
3980             <term><filename>ghc/rts/MBlock.c</filename>
3981               <indexterm><primary><filename>MBlock.c</filename></primary></indexterm>
3982             </term>
3983             <listitem>
3984               <para>These files are really OS-specific rather than
3985               architecture-specific.  In <filename>MBlock.h</filename>
3986               is specified the absolute location at which the RTS
3987               should try to allocate memory on your platform (try to
3988               find an area which doesn't conflict with code or dynamic
3989               libraries).  In <filename>Mblock.c</filename> you might
3990               need to tweak the call to <literal>mmap()</literal> for
3991               your OS.</para>
3992             </listitem>
3993           </varlistentry>
3994         </variablelist>
3995       </sect3>
3996
3997       <sect3 id="sec-mangler">
3998         <title>The mangler</title>
3999         
4000         <para>The mangler is an evil Perl-script
4001         (<filename>ghc/driver/mangler/ghc-asm.lprl</filename>) that
4002         rearranges the assembly code output from gcc to do two main
4003         things:</para>
4004
4005         <itemizedlist>
4006           <listitem>
4007             <para>Remove function prologues and epilogues, and all
4008             movement of the C stack pointer.  This is to support
4009             tail-calls: every code block in Haskell code ends in an
4010             explicit jump, so we don't want the C-stack overflowing
4011             while we're jumping around between code blocks.</para>
4012           </listitem>
4013           <listitem>
4014             <para>Move the <firstterm>info table</firstterm> for a
4015             closure next to the entry code for that closure.  In
4016             unregisterised code, info tables contain a pointer to the
4017             entry code, but in registerised compilation we arrange
4018             that the info table is shoved right up against the entry
4019             code, and addressed backwards from the entry code pointer
4020             (this saves a word in the info table and an extra
4021             indirection when jumping to the closure entry
4022             code).</para>
4023           </listitem>
4024         </itemizedlist>
4025
4026         <para>The mangler is abstracted to a certain extent over some
4027         architecture-specific things such as the particular assembler
4028         directives used to herald symbols.  Take a look at the
4029         definitions for other architectures and use these as a
4030         starting point.</para>
4031       </sect3>
4032
4033       <sect3>
4034         <title>The splitter</title>
4035
4036         <para>The splitter is another evil Perl script
4037         (<filename>ghc/driver/split/ghc-split.lprl</filename>).  It
4038         cooperates with the mangler to support object splitting.
4039         Object splitting is what happens when the
4040         <option>-split-objs</option> option is passed to GHC: the
4041         object file is split into many smaller objects.  This feature
4042         is used when building libraries, so that a program statically
4043         linked against the library will pull in less of the
4044         library.</para>
4045
4046         <para>The splitter has some platform-specific stuff; take a
4047         look and tweak it for your system.</para>
4048       </sect3>
4049
4050       <sect3>
4051         <title>The native code generator</title>
4052
4053         <para>The native code generator isn't essential to getting a
4054         registerised build going, but it's a desirable thing to have
4055         because it can cut compilation times in half.  The native code
4056         generator is described in some detail in the <ulink
4057         url="http://www.cse.unsw.edu.au/~chak/haskell/ghc/comm/">GHC
4058         commentary</ulink>.</para>
4059       </sect3>
4060
4061       <sect3>
4062         <title>GHCi</title>
4063
4064         <para>To support GHCi, you need to port the dynamic linker
4065         (<filename>fptools/ghc/rts/Linker.c</filename>).  The linker
4066         currently supports the ELF and PEi386 object file formats - if
4067         your platform uses one of these then things will be
4068         significantly easier.  The majority of Unix platforms use the
4069         ELF format these days.  Even so, there are some
4070         machine-specific parts of the ELF linker: for example, the
4071         code for resolving particular relocation types is
4072         machine-specific, so some porting of this code to your
4073         architecture will probaly be necessary.</para>
4074         
4075         <para>If your system uses a different object file format, then
4076         you have to write a linker &mdash; good luck!</para>
4077       </sect3>
4078     </sect2>
4079
4080   </sect1>
4081
4082 <sect1 id="sec-build-pitfalls">
4083 <title>Known pitfalls in building Glasgow Haskell
4084
4085 <indexterm><primary>problems, building</primary></indexterm>
4086 <indexterm><primary>pitfalls, in building</primary></indexterm>
4087 <indexterm><primary>building pitfalls</primary></indexterm></title>
4088
4089 <para>
4090 WARNINGS about pitfalls and known &ldquo;problems&rdquo;:
4091 </para>
4092
4093 <para>
4094
4095 <orderedlist>
4096 <listitem>
4097
4098 <para>
4099 One difficulty that comes up from time to time is running out of space
4100 in <literal>TMPDIR</literal>.  (It is impossible for the configuration stuff to
4101 compensate for the vagaries of different sysadmin approaches to temp
4102 space.)
4103 <indexterm><primary>tmp, running out of space in</primary></indexterm>
4104
4105 The quickest way around it is <command>setenv TMPDIR /usr/tmp</command><indexterm><primary>TMPDIR</primary></indexterm> or
4106 even <command>setenv TMPDIR .</command> (or the equivalent incantation with your shell
4107 of choice).
4108
4109 The best way around it is to say
4110
4111 <programlisting>export TMPDIR=&#60;dir&#62;</programlisting>
4112
4113 in your <filename>build.mk</filename> file.
4114 Then GHC and the other <literal>fptools</literal> programs will use the appropriate directory
4115 in all cases.
4116
4117
4118 </para>
4119 </listitem>
4120 <listitem>
4121
4122 <para>
4123 In compiling some support-code bits, e.g., in <filename>ghc/rts/gmp</filename> and even
4124 in <filename>ghc/lib</filename>, you may get a few C-compiler warnings.  We think these
4125 are OK.
4126
4127 </para>
4128 </listitem>
4129 <listitem>
4130
4131 <para>
4132 When compiling via C, you'll sometimes get &ldquo;warning: assignment from
4133 incompatible pointer type&rdquo; out of GCC.  Harmless.
4134
4135 </para>
4136 </listitem>
4137 <listitem>
4138
4139 <para>
4140 Similarly, <command>ar</command>chiving warning messages like the following are not
4141 a problem:
4142
4143 <screen>ar: filename GlaIOMonad__1_2s.o truncated to GlaIOMonad_
4144 ar: filename GlaIOMonad__2_2s.o truncated to GlaIOMonad_
4145 ...</screen>
4146
4147
4148 </para>
4149 </listitem>
4150 <listitem>
4151
4152 <para>
4153  In compiling the compiler proper (in <filename>compiler/</filename>), you <emphasis>may</emphasis>
4154 get an &ldquo;Out of heap space&rdquo; error message.  These can vary with the
4155 vagaries of different systems, it seems.  The solution is simple:
4156
4157
4158 <itemizedlist>
4159 <listitem>
4160
4161 <para>
4162  If you're compiling with GHC 4.00 or later, then the
4163 <emphasis>maximum</emphasis> heap size must have been reached.  This
4164 is somewhat unlikely, since the maximum is set to 64M by default.
4165 Anyway, you can raise it with the
4166 <option>-optCrts-M&lt;size&gt;</option> flag (add this flag to
4167 <constant>&lt;module&gt;&lowbar;HC&lowbar;OPTS</constant>
4168 <command>make</command> variable in the appropriate
4169 <filename>Makefile</filename>).
4170
4171 </para>
4172 </listitem>
4173 <listitem>
4174
4175 <para>
4176  For GHC &#60; 4.00, add a suitable <option>-H</option> flag to the <filename>Makefile</filename>, as
4177 above.
4178
4179 </para>
4180 </listitem>
4181
4182 </itemizedlist>
4183
4184
4185 and try again: <command>gmake</command>.  (see <xref linkend="sec-suffix"/> for information about
4186 <constant>&lt;module&gt;&lowbar;HC&lowbar;OPTS</constant>.)
4187
4188 Alternatively, just cut to the chase:
4189
4190 <screen>$ cd ghc/compiler
4191 $ make EXTRA_HC_OPTS=-optCrts-M128M</screen>
4192
4193
4194 </para>
4195 </listitem>
4196 <listitem>
4197
4198 <para>
4199 If you try to compile some Haskell, and you get errors from GCC about
4200 lots of things from <filename>/usr/include/math.h</filename>, then your GCC was
4201 mis-installed.  <command>fixincludes</command> wasn't run when it should've been.
4202
4203 As <command>fixincludes</command> is now automagically run as part of GCC installation,
4204 this bug also suggests that you have an old GCC.
4205
4206
4207 </para>
4208 </listitem>
4209 <listitem>
4210
4211 <para>
4212 You <emphasis>may</emphasis> need to re-<command>ranlib</command><indexterm><primary>ranlib</primary></indexterm> your libraries (on Sun4s).
4213
4214
4215 <screen>$ cd $(libdir)/ghc-x.xx/sparc-sun-sunos4
4216 $ foreach i ( `find . -name '*.a' -print` ) # or other-shell equiv...
4217 ?    ranlib $i
4218 ?    # or, on some machines: ar s $i
4219 ? end</screen>
4220
4221
4222 We'd be interested to know if this is still necessary.
4223
4224
4225 </para>
4226 </listitem>
4227 <listitem>
4228
4229 <para>
4230 GHC's sources go through <command>cpp</command> before being compiled, and <command>cpp</command> varies
4231 a bit from one Unix to another.  One particular gotcha is macro calls
4232 like this:
4233
4234
4235 <programlisting>SLIT("Hello, world")</programlisting>
4236
4237
4238 Some <command>cpp</command>s treat the comma inside the string as separating two macro
4239 arguments, so you get
4240
4241
4242 <screen>:731: macro `SLIT' used with too many (2) args</screen>
4243
4244
4245 Alas, <command>cpp</command> doesn't tell you the offending file!
4246
4247 Workaround: don't put weird things in string args to <command>cpp</command> macros.
4248 </para>
4249 </listitem>
4250
4251 </orderedlist>
4252
4253 </para>
4254
4255 </sect1>
4256
4257
4258 <sect1 id="platforms"><title>Platforms, scripts, and file names</title>
4259 <para>
4260 GHC is designed both to be built, and to run, on both Unix and Windows.  This flexibility
4261 gives rise to a good deal of brain-bending detail, which we have tried to collect in this chapter.
4262 </para>
4263
4264 <sect2 id="cygwin-and-mingw"><title>Windows platforms: Cygwin, MSYS, and MinGW</title>
4265
4266 <para> The build system is built around Unix-y makefiles.  Because it's not native,
4267 the Windows situation for building GHC is particularly confusing.  This section
4268 tries to clarify, and to establish terminology.</para>
4269
4270 <sect3 id="ghc-mingw"><title>MinGW</title>
4271
4272 <para> <ulink url="http://www.mingw.org">MinGW (Minimalist GNU for Windows)</ulink> 
4273 is a collection of header
4274 files and import libraries that allow one to use <command>gcc</command> and produce
4275 native Win32 programs that do not rely on any third-party DLLs. The
4276 current set of tools include GNU Compiler Collection (<command>gcc</command>), GNU Binary
4277 Utilities (Binutils), GNU debugger (Gdb), GNU make, and a assorted
4278 other utilities. 
4279 </para>
4280
4281 <para> The down-side of MinGW is that the MinGW libraries do not support anything like the full
4282 Posix interface.  
4283 </para>
4284 </sect3>
4285
4286 <sect3 id="ghc-cygwin"><title>Cygwin and MSYS</title>
4287
4288 <para>You can't use the MinGW to <emphasis>build</emphasis> GHC, because MinGW doesn't have a shell,
4289 or the standard Unix commands such as <command>mv</command>, <command>rm</command>,
4290 <command>ls</command>, nor build-system stuff such as <command>make</command> and <command>cvs</command>.
4291 For that, there are two choices: <ulink url="http://www.cygwin.com">Cygwin</ulink> 
4292 and <ulink url="http://www.mingw.org/msys.shtml">MSYS</ulink>:
4293
4294 <itemizedlist>
4295 <listitem><para>
4296 Cygwin comes with compilation tools (<command>gcc</command>, <command>ld</command> and so on), which
4297 compile code that has access to all of Posix.  The price is that the executables must be 
4298 dynamically linked with the Cygwin DLL, so that <emphasis>you cannot run a Cywin-compiled program on a machine
4299 that doesn't have Cygwin</emphasis>.  Worse, Cygwin is a moving target.  The name of the main DLL, <literal>cygwin1.dll</literal>
4300 does not change, but the implementation certainly does.  Even the interfaces to functions
4301 it exports seem to change occasionally. </para>
4302 </listitem>
4303
4304 <listitem><para>
4305 MSYS is a fork of the Cygwin tree, so they
4306 are fundamentally similar.  However, MSYS is by design much smaller and simpler.  Access to the file system goes
4307 through fewer layers, so MSYS is quite a bit faster too.
4308 </para>
4309
4310 <para>Furthermore, MSYS provides no compilation tools; it relies instead on the MinGW tools. These
4311 compile binaries that run with no DLL support, on any Win32 system.
4312 However, MSYS does come with all the make-system tools, such as <command>make</command>, <command>autoconf</command>, 
4313 <command>cvs</command>, <command>ssh</command> etc.  To get these, you have to download the 
4314 MsysDTK (Developer Tool Kit) package, as well as the base MSYS package.
4315 </para>
4316 <para>MSYS does have a DLL, but it's only used by MSYS commands (<command>sh</command>, <command>rm</command>, 
4317 <command>ssh</command> and so on),
4318 not by programs compiled under MSYS.
4319 </para></listitem>
4320
4321 </itemizedlist>
4322
4323 </para>
4324 </sect3>
4325
4326 <sect3><title>Targeting MinGW</title>
4327
4328 <para>We want GHC to compile programs that work on any Win32 system.  Hence:
4329 <itemizedlist>
4330 <listitem><para>
4331 GHC does invoke a C compiler, assembler, linker and so on, but we ensure that it only
4332 invokes the MinGW tools, not the Cygwin ones.  That means that the programs GHC compiles
4333 will work on any system, but it also means that the programs GHC compiles do not have access
4334 to all of Posix.  In particular, they cannot import the (Haskell) Posix 
4335 library; they have to do
4336 their input output using standard Haskell I/O libraries, or native Win32 bindings.</para>
4337 <para> We will call a GHC that targets MinGW in this way <emphasis>GHC-mingw</emphasis>.</para>
4338 </listitem>
4339
4340 <listitem><para>
4341 To make the GHC distribution self-contained, the GHC distribution includes the MinGW <command>gcc</command>,
4342 <command>as</command>, <command>ld</command>, and a bunch of input/output libraries.  
4343 </para></listitem>
4344 </itemizedlist>
4345 So <emphasis>GHC targets MinGW</emphasis>, not Cygwin.
4346 It is in principle possible to build a version of GHC, <emphasis>GHC-cygwin</emphasis>, 
4347 that targets Cygwin instead.  The up-side of GHC-cygwin is
4348 that Haskell programs compiled by GHC-cygwin can import the (Haskell) Posix library.
4349 <emphasis>We do not support GHC-cygwin, however; it is beyond our resources.</emphasis>
4350 </para>
4351
4352 <para>While GHC <emphasis>targets</emphasis> MinGW, that says nothing about 
4353 how GHC is <emphasis>built</emphasis>.  We use both MSYS and Cygwin as build environments for
4354 GHC; both work fine, though MSYS is rather lighter weight.</para>
4355
4356 <para>In your build tree, you build a compiler called <command>ghc-inplace</command>.  It
4357 uses the <command>gcc</command> that you specify using the
4358 <option>--with-gcc</option> flag when you run
4359 <command>configure</command> (see below).
4360 The makefiles are careful to use <command>ghc-inplace</command> (not <command>gcc</command>)
4361 to compile any C files, so that it will in turn invoke the correct <command>gcc</command> rather that
4362 whatever one happens to be in your path.  However, the makefiles do use whatever <command>ld</command> 
4363 and <command>ar</command> happen to be in your path. This is a bit naughty, but (a) they are only
4364 used to glom together .o files into a bigger .o file, or a .a file, 
4365 so they don't ever get libraries (which would be bogus; they might be the wrong libraries), and (b)
4366 Cygwin and MinGW use the same .o file format.  So its ok.
4367 </para>
4368 </sect3>
4369
4370 <sect3><title> File names </title>
4371
4372 <para>Cygwin, MSYS, and the underlying Windows file system all understand file paths of form <literal>c:/tmp/foo</literal>.
4373 However:
4374 <itemizedlist>
4375 <listitem><para>
4376 MSYS programs understand <filename>/bin</filename>, <filename>/usr/bin</filename>, and map Windows's lettered drives as
4377 <filename>/c/tmp/foo</filename> etc.  The exact mount table is given in the doc subdirectory of the MSYS distribution.
4378 </para>
4379 <para> When it invokes a command, the MSYS shell sees whether the invoked binary lives in the MSYS <filename>/bin</filename>
4380 directory.  If so, it just invokes it.  If not, it assumes the program is no an MSYS program, and walks over the command-line
4381 arguments changing MSYS paths into native-compatible paths.  It does this inside sub-arguments and inside quotes. For example,
4382 if you invoke
4383 <programlisting>foogle -B/c/tmp/baz</programlisting>
4384 the MSYS shell will actually call <literal>foogle</literal> with argument <literal>-Bc:/tmp/baz</literal>.
4385 </para></listitem>
4386
4387 <listitem><para>
4388 Cygwin programs have a more complicated mount table, and map the lettered drives as <filename>/cygdrive/c/tmp/foo</filename>.
4389 </para>
4390 <para>The Cygwin shell does no argument processing when invoking non-Cygwin programs.
4391 </para></listitem>
4392 </itemizedlist>
4393 </para>
4394 </sect3>
4395
4396 <sect3><title>Crippled <command>ld</command></title>
4397
4398 <para>
4399 It turns out that on both Cygwin and MSYS, the <command>ld</command> has a
4400 limit of 32kbytes on its command line.  Especially when using split object
4401 files, the make system can emit calls to <command>ld</command> with thousands
4402 of files on it.  Then you may see something like this:
4403 <programlisting>
4404 (cd Graphics/Rendering/OpenGL/GL/QueryUtils_split &amp;&amp; /mingw/bin/ld -r -x -o ../QueryUtils.o *.o)
4405 /bin/sh: /mingw/bin/ld: Invalid argument
4406 </programlisting>
4407 The solution is either to switch off object file splitting (set
4408 <option>SplitObjs</option> to <literal>NO</literal> in your
4409 <filename>build.mk</filename>),
4410 or to make the module smaller.
4411 </para>
4412 </sect3>
4413
4414 <sect3><title>Host System vs Target System</title>
4415
4416 <para>
4417 In the source code you'll find various ifdefs looking like:
4418 <programlisting>#ifdef mingw32_HOST_OS
4419   ...blah blah...
4420 #endif</programlisting>
4421 and 
4422 <programlisting>#ifdef mingw32_TARGET_OS
4423   ...blah blah...
4424 #endif</programlisting>
4425 These macros are set by the configure script (via the file config.h).
4426 Which is which?  The criterion is this.  In the ifdefs in GHC's source code:
4427 <itemizedlist>
4428   <listitem>
4429     <para>The "host" system is the one on which GHC itself will be run.</para>
4430   </listitem>
4431   <listitem>
4432     <para>The "target" system is the one for which the program compiled by GHC will be run.</para>
4433   </listitem>
4434 </itemizedlist>
4435 For a stage-2 compiler, in which GHCi is available, the "host" and "target" systems must be the same.
4436 So then it doesn't really matter whether you use the HOST_OS or TARGET_OS cpp macros.
4437
4438 </para>
4439 </sect3>
4440
4441 </sect2>
4442
4443 <sect2><title>Wrapper scripts</title>
4444
4445 <para>
4446 Many programs, including GHC itself and hsc2hs, need to find associated binaries and libraries.
4447 For <emphasis>installed</emphasis> programs, the strategy depends on the platform.  We'll use
4448 GHC itself as an example:
4449 <itemizedlist>
4450   <listitem> <para>
4451   On Unix, the command <command>ghc</command> is a shell script, generated by adding installation
4452   paths to the front of the source file <filename>ghc.sh</filename>, 
4453   that invokes the real binary, passing "-B<emphasis>path</emphasis>" as an argument to tell <command>ghc</command>
4454   where to find its supporting files. 
4455   </para> </listitem>
4456
4457   <listitem> <para>
4458   On vanilla Windows, it turns out to be much harder to make reliable script to be run by the
4459   native Windows shell <command>cmd</command> (e.g. limits on the length
4460    of the command line).  So instead we invoke the GHC binary directly, with no -B flag.
4461   GHC uses the Windows <literal>getExecDir</literal> function to find where the executable is,
4462   and from that figures out where the supporting files are.
4463   </para> </listitem>
4464 </itemizedlist>
4465 (You can find the layout of GHC's supporting files in the
4466   section "Layout of installed files" of Section 2 of the GHC user guide.)
4467 </para>
4468 <para>
4469 Things work differently for <emphasis>in-place</emphasis> execution, where you want to
4470 execute a program that has just been built in a build tree. The difference is that the
4471 layout of the supporting files is different.
4472 In this case, whether on Windows or Unix, we always use a shell script. This works OK
4473 on Windows because the script is executed by MSYS or Cygwin, which don't have the
4474 shortcomings of the native Windows <command>cmd</command> shell.
4475 </para>
4476
4477 </sect2>
4478
4479 </sect1>
4480
4481 <sect1 id="winbuild"><title>Instructions for building under Windows</title>
4482
4483 <para>
4484 This section gives detailed instructions for how to build 
4485 GHC from source on your Windows machine. Similar instructions for
4486 installing and running GHC may be found in the user guide. In general,
4487 Win95/Win98 behave the same, and WinNT/Win2k behave the same.
4488 </para>
4489 <para>
4490 Make sure you read the preceding section on platforms (<xref linkend="platforms"/>)
4491 before reading section.
4492 You don't need Cygwin or MSYS to <emphasis>use</emphasis> GHC, 
4493 but you do need one or the other to <emphasis>build</emphasis> GHC.</para>
4494
4495
4496 <sect2 id="msys-install"><title>Installing and configuring MSYS</title>
4497
4498 <para>
4499 MSYS is a lightweight alternative to Cygwin.  
4500 You don't need MSYS to <emphasis>use</emphasis> GHC, 
4501 but you do need it or Cygwin to <emphasis>build</emphasis> GHC.
4502 Here's how to install MSYS.
4503 <itemizedlist>
4504 <listitem><para>
4505 Go to <ulink url="http://www.mingw.org/download.shtml">http://www.mingw.org/download.shtml</ulink> and 
4506 download the following (of course, the version numbers will differ):
4507 <itemizedlist>
4508   <listitem><para>The main MSYS package (binary is sufficient): <literal>MSYS-1.0.9.exe</literal>
4509   </para></listitem>
4510   <listitem><para>The MSYS developer's toolkit (binary is sufficient): <literal>msysDTK-1.0.1.exe</literal>.
4511                     This provides <command>make</command>, <command>autoconf</command>, 
4512                     <command>ssh</command>, <command>cvs</command> and probably more besides.
4513   </para></listitem>
4514 </itemizedlist>
4515 Run both executables (in the order given above) to install them.  I put them in <literal>c:/msys</literal>
4516 </para></listitem>
4517
4518 <listitem><para>
4519 Set the following environment variables
4520 <itemizedlist>
4521    <listitem><para><literal>PATH</literal>: add <literal>c:/msys/1.0/bin</literal> and 
4522                     <literal>c:/msys/1.0/local/bin</literal>
4523                     to your path.  (Of course, the version number may differ.)
4524                     MSYS mounts the former as both <literal>/bin</literal> and 
4525                     <literal>/usr/bin</literal> and the latter as <literal>/usr/local/bin</literal>.
4526   </para></listitem>
4527
4528   <listitem><para><literal>HOME</literal>: set to your home directory (e.g. <literal>c:/userid</literal>).
4529   This is where, among other things, <command>ssh</command> will look for your <literal>.ssh</literal> directory.
4530   </para></listitem>  
4531
4532   <listitem><para><literal>SHELL</literal>: set to <literal>c:/msys/1.0/bin/sh.exe</literal>
4533   </para></listitem>
4534
4535   <listitem><para><literal>CVS_RSH</literal>: set to <literal>c:/msys/1.0/bin/ssh.exe</literal>.  Only necessary if
4536               you are using CVS.
4537   </para></listitem>
4538
4539   <listitem><para><literal>MAKE_MODE</literal>: set to <literal>UNIX</literal>.  (I'm not certain this is necessary for MSYS.)
4540   </para></listitem>
4541
4542 </itemizedlist>
4543 </para></listitem>
4544
4545 <listitem><para>
4546 Check that the <literal>CYGWIN</literal> environment variable is <emphasis>not</emphasis> set.  It's a bad bug
4547 that MSYS is affected by this, but if you have CYGWIN set to "ntsec ntea", which is right for Cygwin, it
4548 causes the MSYS <command>ssh</command> to bogusly fail complaining that your <filename>.ssh/identity</filename>
4549 file has too-liberal permissinos.
4550 </para></listitem>
4551
4552 </itemizedlist>
4553 </para>
4554 <para>Here are some points to bear in mind when using MSYS:
4555 <itemizedlist>
4556 <listitem> <para> MSYS does some kind of special magic to binaries stored in 
4557 <filename>/bin</filename> and <filename>/usr/bin</filename>, which are by default both mapped
4558 to <filename>c:/msys/1.0/bin</filename> (assuming you installed MSYS in <filename>c:/msys</filename>).
4559 Do not put any other binaries (such as GHC or Alex) in this directory or its sub-directories: 
4560 they fail in mysterious ways.  However, it's fine to put other binaries in <filename>/usr/local/bin</filename>,
4561 which maps to <filename>c:/msys/1.0/local/bin</filename>.</para></listitem>
4562
4563 <listitem> <para> MSYS seems to implement symbolic links by copying, so sharing is lost.
4564 </para></listitem>
4565
4566 <listitem> <para>
4567 Win32 has a <command>find</command> command which is not the same as MSYS's find.
4568 You will probably discover that the Win32 <command>find</command> appears in your <constant>PATH</constant>
4569 before the MSYS one, because it's in the <emphasis>system</emphasis> <constant>PATH</constant> 
4570 environment variable, whereas you have probably modified the <emphasis>user</emphasis> <constant>PATH</constant> 
4571 variable.  You can always invoke <command>find</command> with an absolute path, or rename it.
4572 </para></listitem>
4573
4574 <listitem> <para>
4575 MSYS comes with <command>bzip</command>, and MSYS's <command>tar</command>'s <literal>-j</literal> 
4576 will bunzip an archive (e.g. <literal>tar xvjf foo.tar.bz2</literal>).  Useful when you get a
4577 bzip'd dump.</para></listitem>
4578
4579 </itemizedlist>
4580 </para>
4581 </sect2>
4582
4583 <sect2 id="install-cygwin"><title>Installing and configuring Cygwin</title>
4584
4585 <para> Install Cygwin from <ulink url="http://www.cygwin.com/">http://www.cygwin.com/</ulink>.
4586 The installation process is straightforward; we install it in
4587 <filename>c:/cygwin</filename>.</para>
4588 <para>
4589 You must install enough Cygwin <emphasis>packages</emphasis> to support
4590 building GHC.  If you miss out any of these, strange things will happen to you.   There are two ways to do this:
4591 <itemizedlist>
4592 <listitem><para>The direct, but laborious way is to 
4593 select all of the following packages in the installation dialogue:
4594               <command>cvs</command>, 
4595               <command>openssh</command>,
4596               <command>autoconf</command>,
4597               <command>binutils</command> (includes ld and (I think) ar),
4598               <command>gcc</command>,
4599               <command>flex</command>,
4600               <command>make</command>.
4601 To see thse packages, 
4602 click on the "View" button in the "Select Packages" 
4603 stage of Cygwin's installation dialogue, until the view says "Full".  The default view, which is
4604 "Category" isn't very helpful, and the "View" button is rather unobtrousive.
4605 </para>
4606 </listitem>
4607
4608 <listitem><para>The clever way is to point the Cygwin installer at the
4609 <command>ghc-depends</command> package, which is kept at <ulink
4610 url="http://haskell.org/ghc/cygwin">http://haskell.org/ghc/cygwin</ulink>.
4611 When the Cygwin installer asks you to "Choose a Download Site", choose one of
4612 the
4613 offered mirror sites; and then type "http://haskell.org/ghc/cygwin" into the
4614 "User URL" box and click "Add"; now two sites are selected. (The Cygwin
4615 installer remembers this for next time.)
4616 Click "Next".</para>
4617 <para>In the "Select Packages" dialogue box that follows, click the "+" sign by
4618 "Devel", scroll down to the end of the "Devel" packages, and choose
4619 <command>ghc-depends</command>.
4620 The package <command>ghc-depends</command> will not actually install anything itself, 
4621 but forces additional packages to be added by the Cygwin installer.
4622 </para>
4623 </listitem>
4624 </itemizedlist>
4625 </para>
4626
4627 <para> Now set the following user environment variables:
4628 <itemizedlist>
4629
4630 <listitem><para> Add <filename>c:/cygwin/bin</filename> and <filename>c:/cygwin/usr/bin</filename> to your 
4631 <constant>PATH</constant></para></listitem>
4632
4633 <listitem>
4634 <para>
4635 Set <constant>MAKE_MODE</constant> to <literal>UNIX</literal>. If you
4636 don't do this you get very weird messages when you type
4637 <command>make</command>, such as:
4638 <screen>/c: /c: No such file or directory</screen>
4639 </para>
4640 </listitem>
4641
4642 <listitem><para> Set <constant>SHELL</constant> to
4643 <filename>c:/cygwin/bin/bash</filename>. When you invoke a shell in Emacs, this
4644 <constant>SHELL</constant> is what you get.
4645 </para></listitem>
4646
4647 <listitem><para> Set <constant>HOME</constant> to point to your 
4648 home directory.  This is where, for example,
4649 <command>bash</command> will look for your <filename>.bashrc</filename>
4650 file.  Ditto <command>emacs</command> looking for <filename>.emacsrc</filename>
4651 </para></listitem>
4652 </itemizedlist>
4653 </para>
4654
4655 <para>Here are some things to be aware of when using Cygwin:
4656 <itemizedlist>
4657 <listitem> <para>Cygwin doesn't deal well with filenames that include
4658 spaces. "<filename>Program Files</filename>" and "<filename>Local files</filename>" are
4659 common gotchas.
4660 </para></listitem>
4661
4662 <listitem> <para> Cygwin implements a symbolic link as a text file with some
4663 magical text in it.  So other programs that don't use Cygwin's
4664 I/O libraries won't recognise such files as symlinks.  
4665 In particular, programs compiled by GHC are meant to be runnable
4666 without having Cygwin, so they don't use the Cygwin library, so
4667 they don't recognise symlinks.
4668 </para></listitem>
4669
4670 <listitem> <para>
4671 See the notes in <xref linkend="msys-install"/> about <command>find</command> and <command>bzip</command>,
4672 which apply to Cygwin too.
4673 </para></listitem>
4674
4675 <listitem>
4676 <para>
4677 Some script files used in the make system start with "<command>#!/bin/perl</command>",
4678 (and similarly for <command>sh</command>).  Notice the hardwired path!
4679 So you need to ensure that your <filename>/bin</filename> directory has at least
4680 <command>sh</command>, <command>perl</command>, and <command>cat</command> in it.
4681 All these come in Cygwin's <filename>bin</filename> directory, which you probably have
4682 installed as <filename>c:/cygwin/bin</filename>.  By default Cygwin mounts "<filename>/</filename>" as
4683 <filename>c:/cygwin</filename>, so if you just take the defaults it'll all work ok.
4684 (You can discover where your Cygwin
4685 root directory <filename>/</filename> is by typing <command>mount</command>.)
4686 Provided <filename>/bin</filename> points to the Cygwin <filename>bin</filename>
4687 directory, there's no need to copy anything.  If not, copy these binaries from the <filename>cygwin/bin</filename>
4688 directory (after fixing the <filename>sh.exe</filename> stuff mentioned in the previous bullet).
4689 </para>
4690 </listitem>
4691
4692 <listitem>
4693 <para>
4694 By default, cygwin provides the command shell <filename>ash</filename>
4695 as <filename>sh.exe</filename>.   It seems to be fine now, but in the past we
4696 saw build-system problems that turned out to be due to bugs in <filename>ash</filename>
4697 (to do with quoting and length of command lines).  On the other hand <filename>bash</filename> seems
4698 to be rock solid.
4699 If this happens to you (which it shouldn't), in <filename>cygwin/bin</filename>
4700 remove the supplied <filename>sh.exe</filename> (or rename it as <filename>ash.exe</filename>),
4701 and copy <filename>bash.exe</filename> to  <filename>sh.exe</filename>.
4702 You'll need to do this in Windows Explorer or the Windows <command>cmd</command> shell, because
4703 you can't rename a running program!
4704 </para>
4705 </listitem>
4706 </itemizedlist>
4707 </para>
4708
4709 </sect2>
4710
4711
4712 <sect2 id="configure-ssh"><title>Configuring SSH</title>
4713
4714 <para><command>ssh</command> comes with both Cygwin and MSYS. 
4715 (Cygwin note: you need to ask for package <command>openssh</command> (not ssh)
4716 in the Cygwin list of packages; or use the <command>ghc-depends</command>
4717 package -- see <xref linkend="install-cygwin"/>.)</para>
4718
4719 <para>There are several strange things about <command>ssh</command> on Windows that you need to know.
4720 <itemizedlist>
4721 <listitem>
4722 <para>
4723        The programs <command>ssh-keygen1</command>, <command>ssh1</command>, and <command>cvs</command>,
4724        seem to lock up <command>bash</command> entirely if they try to get user input (e.g. if
4725        they ask for a password).  To solve this, start up <filename>cmd.exe</filename> 
4726        and run it as follows:
4727 <screen>c:\tmp> set CYGWIN32=tty
4728 c:\tmp> c:/user/local/bin/ssh-keygen1</screen> </para>
4729 </listitem>
4730
4731 <listitem><para> (Cygwin-only problem, I think.)
4732 <command>ssh</command> needs to access your directory <filename>.ssh</filename>, in your home directory.  
4733 To determine your home directory <command>ssh</command> first looks in 
4734 <filename>c:/cygwin/etc/passwd</filename> (or wherever you have Cygwin installed).  If there's an entry
4735 there with your userid, it'll use that entry to determine your home directory, <emphasis>ignoring
4736 the setting of the environment variable $HOME</emphasis>.  If the home directory is
4737 bogus, <command>ssh</command> fails horribly.   The best way to see what is going on is to say
4738 <screen>ssh -v cvs.haskell.org</screen>
4739 which makes <command>ssh</command> print out information about its activity.
4740 </para>
4741 <para> You can fix this problem, either by correcting the home-directory field in 
4742 <filename>c:/cygwin/etc/passwd</filename>, or by simply deleting the entire entry for your userid. If
4743 you do that, <command>ssh</command> uses the $HOME environment variable instead.
4744 </para>
4745
4746 </listitem>
4747
4748 <listitem>
4749             <para>To protect your
4750             <literal>.ssh</literal> from access by anyone else,
4751             right-click your <literal>.ssh</literal> directory, and
4752             select <literal>Properties</literal>.  If you are not on
4753             the access control list, add yourself, and give yourself
4754             full permissions (the second panel).  Remove everyone else
4755             from the access control list.  Don't leave them there but
4756             deny them access, because 'they' may be a list that
4757             includes you!</para>
4758 </listitem>
4759
4760 <listitem>
4761             <para>In fact <command>ssh</command> 3.6.1 now seems to <emphasis>require</emphasis>
4762               you to have Unix permissions 600 (read/write for owner only) 
4763               on the <literal>.ssh/identity</literal> file, else it 
4764               bombs out.  For your local C drive, it seems that <literal>chmod 600 identity</literal> works,
4765               but on Windows NT/XP, it doesn't work on a network drive (exact dteails obscure).  
4766               The solution seems to be to set the $CYGWIN environment
4767               variable to "<literal>ntsec neta</literal>".  The $CYGWIN environment variable is discussed
4768               in <ulink url="http://cygwin.com/cygwin-ug-net/using-cygwinenv.html">the Cygwin User's Guide</ulink>,
4769               and there are more details in <ulink url="http://cygwin.com/faq/faq_4.html#SEC44">the Cygwin FAQ</ulink>.
4770               </para>
4771 </listitem>
4772 </itemizedlist>
4773 </para>
4774 </sect2>
4775
4776 <sect2><title>Other things you need to install</title>
4777
4778 <para>You have to install the following other things to build GHC, listed below.</para>
4779
4780 <para>On Windows you often install executables in directories with spaces, such as 
4781 "<filename>Program Files</filename>". However, the <literal>make</literal> system for fptools doesn't 
4782 deal with this situation (it'd have to do more quoting of binaries), so you are strongly advised
4783 to put binaries for all tools in places with no spaces in their path.
4784 On both MSYS and Cygwin, it's perfectly OK to install such programs in the standard Unixy places,
4785 <filename>/usr/local/bin</filename> and <filename>/usr/local/lib</filename>.  But it doesn't matter,
4786 provided they are in your path.
4787 <itemizedlist>
4788 <listitem>
4789 <para>
4790 Install an executable GHC, from <ulink url="http://www.haskell.org/ghc">http://www.haskell.org/ghc</ulink>.
4791 This is what you will use to compile GHC.  Add it in your
4792 <constant>PATH</constant>: the installer tells you the path element
4793 you need to add upon completion.
4794 </para>
4795 </listitem>
4796
4797 <listitem>
4798 <para>
4799 Install an executable Happy, from <ulink url="http://www.haskell.org/happy">http://www.haskell.org/happy</ulink>.
4800 Happy is a parser generator used to compile the Haskell grammar.  Under MSYS or Cygwin you can easily
4801 build it from the source distribution using
4802 <screen>$ ./configure
4803 $ make
4804 $ make install</screen>
4805 This should install it in <filename>/usr/local/bin</filename> (which maps to <filename>c:/msys/1.0/local/bin</filename>
4806 on MSYS).
4807 Make sure the installation directory is in your
4808 <constant>PATH</constant>.
4809 </para>
4810 </listitem>
4811
4812           <listitem>
4813             <para>Install an executable Alex.  This can be done by building from the
4814             source distribution in the same way as Happy.  Sources are
4815             available from <ulink
4816             url="http://www.haskell.org/alex">http://www.haskell.org/alex</ulink>.</para>
4817           </listitem>
4818
4819 <listitem>
4820 <para>GHC uses the <emphasis>mingw</emphasis> C compiler to
4821 generate code, so you have to install that (see <xref linkend="cygwin-and-mingw"/>). 
4822 Just pick up a mingw bundle at
4823 <ulink url="http://www.mingw.org/">http://www.mingw.org/</ulink>.
4824 We install it in <filename>c:/mingw</filename>.
4825 </para>
4826
4827 <para><emphasis>On MSYS</emphasis>, add <literal>c:/mingw/bin</literal> to your PATH. MSYS does not provide <command>gcc</command>,
4828 <command>ld</command>, <command>ar</command>, and so on, because it just uses the MinGW ones.  So you need them
4829 in your path.
4830 </para>
4831
4832 <para><emphasis>On Cygwin, do not</emphasis> add any of the <emphasis>mingw</emphasis> binaries to your  path.
4833 They are only going to get used by explicit access (via the --with-gcc flag you
4834 give to <command>configure</command> later).  If you do add them to your path
4835 you are likely to get into a mess because their names overlap with Cygwin
4836 binaries.
4837 On the other hand, you <emphasis>do</emphasis> need <command>ld</command>, <command>ar</command>
4838 (and perhaps one or two other things) in your path.  The Cygwin ones are fine,
4839 but you must have them; hence needing the  Cygwin binutils package.
4840 </para>
4841 </listitem>
4842
4843
4844 <listitem>
4845 <para>We use <command>emacs</command> a lot, so we install that too.
4846 When you are in <filename>fptools/ghc/compiler</filename>, you can use
4847 "<literal>make tags</literal>" to make a TAGS file for emacs.  That uses the utility
4848 <filename>fptools/ghc/utils/hasktags/hasktags</filename>, so you need to make that first.
4849 The most convenient way to do this is by going <literal>make boot</literal> in <filename>fptools/ghc</filename>.
4850 The <literal>make tags</literal> command also uses <command>etags</command>, which comes with <command>emacs</command>,
4851 so you will need to add <filename>emacs/bin</filename> to your <literal>PATH</literal>.
4852 </para>
4853 </listitem>
4854
4855           <listitem>
4856             <para>You might want to install GLUT in your MSYS/Cygwin
4857               installation, otherwise the GLUT package will not be built with
4858               GHC.</para>
4859           </listitem>
4860
4861 <listitem>
4862 <para> Finally, check out a copy of GHC sources from
4863 the CVS repository, following the instructions above (<xref linkend="cvs-access"/>).
4864 </para>
4865 </listitem>
4866 </itemizedlist>
4867 </para>
4868 </sect2>
4869
4870 <sect2><title>Building GHC</title>
4871
4872 <para>OK!  
4873 Now go read the documentation above on building from source (<xref linkend="sec-building-from-source"/>); 
4874 the bullets below only tell
4875 you about Windows-specific wrinkles.</para>
4876 <itemizedlist>
4877 <listitem>
4878 <para>
4879 If you used <command>autoconf</command> instead of <command>autoreconf</command>,
4880 you'll get an error when you run <filename>./configure</filename>:
4881 <screen>
4882 ...lots of stuff...
4883 creating mk/config.h
4884 mk/config.h is unchanged
4885 configuring in ghc
4886 running /bin/sh ./configure  --cache-file=.././config.cache --srcdir=.
4887 ./configure: ./configure: No such file or directory
4888 configure: error: ./configure failed for ghc</screen>
4889 </para>
4890 </listitem>
4891
4892 <listitem> <para><command>autoreconf</command> seems to create the file <filename>configure</filename>
4893 read-only.  So if you need to run autoreconf again (which I sometimes do for safety's sake),
4894 you get
4895 <screen>/usr/bin/autoconf: cannot create configure: permission denied</screen>
4896 Solution: delete <filename>configure</filename> first.
4897 </para></listitem>
4898
4899 <listitem>
4900   <para> 
4901     After <command>autoreconf</command> run <command>./configure</command> in
4902     <filename>fptools/</filename> thus:
4903
4904 <screen>$ ./configure --host=i386-unknown-mingw32 --with-gcc=c:/mingw/bin/gcc</screen>
4905 This is the point at which you specify that you are building GHC-mingw
4906 (see <xref linkend="ghc-mingw"/>). </para>
4907
4908 <para> Both these options are important! It's possible to get into
4909 trouble using the wrong C compiler!</para>
4910 <para>
4911 Furthermore, it's <emphasis>very important</emphasis> that you specify a 
4912 full MinGW path for <command>gcc</command>, not a Cygwin path, because GHC (which
4913 uses this path to invoke <command>gcc</command>) is a MinGW program and won't
4914 understand a Cygwin path.  For example, if you 
4915 say <literal>--with-gcc=/mingw/bin/gcc</literal>, it'll be interpreted as
4916 <filename>/cygdrive/c/mingw/bin/gcc</filename>, and GHC will fail the first
4917 time it tries to invoke it.   Worse, the failure comes with
4918 no error message whatsoever.  GHC simply fails silently when first invoked, 
4919 typically leaving you with this:
4920 <screen>make[4]: Leaving directory `/cygdrive/e/fptools-stage1/ghc/rts/gmp'
4921 ../../ghc/compiler/ghc-inplace -optc-mno-cygwin -optc-O 
4922   -optc-Wall -optc-W -optc-Wstrict-prototypes -optc-Wmissing-prototypes 
4923   -optc-Wmissing-declarations -optc-Winline -optc-Waggregate-return 
4924   -optc-Wbad-function-cast -optc-Wcast-align -optc-I../includes 
4925   -optc-I. -optc-Iparallel -optc-DCOMPILING_RTS 
4926   -optc-fomit-frame-pointer -O2 -static 
4927   -package-name rts -O -dcore-lint -c Adjustor.c -o Adjustor.o
4928 make[2]: *** [Adjustor.o] Error 1
4929 make[1]: *** [all] Error 1
4930 make[1]: Leaving directory `/cygdrive/e/fptools-stage1/ghc'
4931 make: *** [all] Error 1</screen>
4932 Be warned!
4933 </para>
4934
4935 <para>
4936 If you want to build GHC-cygwin (<xref linkend="ghc-cygwin"/>)
4937 you'll have to do something more like:
4938 <screen>$ ./configure --with-gcc=...the Cygwin gcc...</screen>
4939 </para>
4940 </listitem>
4941
4942 <listitem><para>
4943 If you are paranoid, delete <filename>config.cache</filename> if it exists.
4944 This file occasionally remembers out-of-date configuration information, which 
4945 can be really confusing.
4946 </para>
4947 </listitem>
4948
4949 <listitem><para> You almost certainly want to set
4950 <programlisting>SplitObjs = NO</programlisting>
4951 in your <filename>build.mk</filename> configuration file (see <xref linkend="sec-build-config"/>).
4952 This tells the build system not to split each library into a myriad of little object files, one
4953 for each function.  Doing so reduces binary sizes for statically-linked binaries, but on Windows
4954 it dramatically increases the time taken to build the libraries in the first place.
4955 </para>
4956 </listitem>
4957
4958 <listitem><para> Do not attempt to build the documentation.
4959 It needs all kinds of wierd Jade stuff that we haven't worked out for
4960 Win32.</para></listitem>
4961 </itemizedlist>
4962 </sect2>
4963
4964
4965 <sect2><title>A Windows build log using Cygwin</title>
4966
4967 <para>Here is a complete, from-scratch, log of all you need to build GHC using
4968 Cygwin, kindly provided by Claus Reinke.  It does not discuss alternative
4969 choices, but it gives a single path that works.</para>
4970 <programlisting>- Install some editor (vim, emacs, whatever)
4971
4972 - Install cygwin (http://www.cygwin.com)
4973     ; i used 1.5.16-1, installed in c:\cygwin
4974   - run 'setup.exe'
4975     Choose a Download Source:
4976         select 'download from internet';
4977     Select Root Install Directory:
4978         root dir: c:\cygwin; 
4979         install for: all users;
4980         default file type: unix
4981     Select Local Package Directory
4982         choose a spare temporary home
4983     Select Your Internet Connection
4984         Use IE5 settings
4985     Choose a Download Site
4986         Choose your preferred main mirror and
4987         Add 'http://www.haskell.org/ghc/cygwin'
4988     Select Packages
4989         In addition to 'Base' (default install), 
4990         select 'Devel->ghc-depends'
4991
4992 - Install mingw (http://www.mingw.org/)
4993     ; i used MinGW-3.1.0-1.exe
4994     ; installed in c:\mingw
4995   - you probably want to add GLUT 
4996     ; (http://www.xmission.com/~nate/glut.html)
4997     ; i used glut-3.7.3-mingw32.tar
4998
4999 - Get recent binary snapshot of ghc-6.4.1 for mingw 
5000     ; (http://www.haskell.org/ghc/dist/stable/dist/)
5001   - unpack in c:/ghc
5002   - add C:\ghc\ghc-6.4.1\bin to %PATH%
5003     (Start->Control Panel->System->Advanced->Environment Variables)
5004
5005 - Get cvs version of ghc
5006     ; also, subscribe to cvs-all@haskell.org, or follow the mailing list
5007     ; archive, in case you checkout a version with problems
5008     ; http://www.haskell.org//pipermail/cvs-all/
5009   - mkdir c:/fptools; cd c:/fptools 
5010     ; (or whereever you want your cvs tree to be)
5011   - export CVSROOT=:pserver:anoncvs@cvs.haskell.org:/cvs
5012   - cvs login
5013     ; pw: cvs
5014   - cvs checkout fpconfig
5015   - cd fptools
5016   - cvs checkout ghc hslibs libraries
5017
5018 - Build ghc, using cygwin and mingw, targetting mingw
5019   - export PATH=/cygdrive/c/ghc/ghc-6.4.1:$PATH
5020     ; for haddock, alex, happy (*)
5021   - export PATH=/cygdrive/c/mingw/bin:$PATH
5022     ; without, we pick up some cygwin tools at best!
5023   - cd c:/fptools/fptools
5024     ; (if you aren't there already)
5025   - autoreconf
5026   - ./configure --host=i386-unknown-mingw32 --with-gcc=C:/Mingw/bin/gcc.exe
5027     ; we use cygwin, but build for windows
5028   - cp mk/build.mk.sample mk/build.mk
5029   - in mk/build.mk:
5030     add line:       SplitObjs = NO
5031         (MSYS seems slow when there are zillions of object files)
5032     uncomment line: BuildFlavour = perf
5033         (or BuildFlavour = devel, if you are doing development)
5034     add line:       BIN_DIST=1
5035   - make 2>&amp;1 | tee make.log
5036     ; always useful to have a log around
5037
5038 - Package up binary distribution
5039   - make binary-dist Project=Ghc 2>&amp;1 | tee make-bin-dist.log
5040     ; always useful to have a log around
5041   - cd ghc-6.5
5042   - chmod +x ../distrib/prep-bin-dist-mingw
5043     ; if you're happy with the script's contents (*)
5044   - ../distrib/prep-bin-dist-mingw
5045     ; then tar up, unpack where wanted, and enjoy</programlisting>
5046 </sect2>
5047 </sect1>
5048
5049 <index/>
5050
5051 </article>