All Databases MacTech Vol 10-1994

Executing Code

Volume Number: 10

Issue Number: 4

Column Tag: Powering UP

Executing Code

On A Power Macintosh

PEF files are more than just object code; at last you get initialization

routines and global data

By Richard Clark, Apple Computer, Inc.

Note: Source code files accompanying article are located on MacTech CD-ROM or

source code disks.

This month’s Powering Up takes a different approach than previous columns - in

the past, we told you about moving existing code to the Power Macintosh. Today, we’ll

tell you some things about using the new capabilities of the Power Macintosh runtime

to enhance new and existing applications.

The greatest advantages on the Power Macintosh come from its speed and a new

runtime architecture. This architecture was designed from the ground up for operation

on a RISC processor; in fact, most of the new runtime was derived from IBM’s runtime

model for their RISC systems. The new runtime architecture supports:

• Executable code that may be stored anywhere (i.e. in ROM, in a resource, or in

the data fork of a file)

• Executable code which may have its own static data (including global variables)

• Executable code which may export code and data, and which can import code and

data automatically from other executables, and

• Register-based calling conventions which execute quickly and efficiently

The structure of executable code

On a Power Macintosh, all native code is packaged up as “fragments.” An

application consists of one large fragment (stored in the data fork of the application

file), with supporting resources such as windows and menus in the resource fork.

Other kinds of PowerPC code, such as INITs and CDEVs, may be stored as fragments in a

resource or in the data fork of the file. Finally, the ROMs on a Power Macintosh include

fragments which implement parts of the System software.

To support such a wide range of uses, each fragment has some important common

capabilities: each has its own static data, references to imported code or data, and

pointers to functions and variables that it exports. Each fragment is stored in a “PEF

container”, which holds code in the Preferred Executable Format (PEF). Each PEF

container consists of three parts: a Data Segment, a Code Segment, and a Loader

Segment. Each of these segments is used when loading a fragment into memory:

• The Code Segment holds the executable code. The system treats code as read only,

and may load the code anywhere in memory. As a result, all branches in the code

have to be either relative to the current location, or go through pointers.

• The Data Segment holds the static data for the code, and a special table called the

“Table of Contents” or TOC. The TOC contains pointers to each global variable

used by the program, as well as pointers to code which cannot be accessed through

a PC-relative branch or which exist in other code fragments.

• The Loader Segment contains tables which list each import and export for the

current fragment, and other information as required to load a fragment.

PEF containers always retain the same format no matter where they are stored.

When a fragment is first loaded, the Code and Data segments are copied into memory

(where they become Code and Data sections.) If a fragment uses code or data from other

fragments, the Code Fragment Manager (CFM) locates and loads the referenced

fragments. The CFM then sets up the static data for each loaded fragment; this involves

filling in pointers in the Table Of Contents, expanding initialized data values, and calling

the fragment’s optional “initialization routine.” Once this is done, the CFM returns the

fragment’s “main” address and a unique “connection ID.”

If you try to load an already loaded fragment, the code is not loaded twice, but a

second copy of the data might be loaded. Fragments support three types of data sharing:

• Global data sharing uses only one data section no matter now many times the code

is loaded.

• Per-context data sharing - the default setting - loads the code once, but allocates

a different data section for each “context.” (A “context” is a name space

associated with each application, so all of the fragments used by an application

(and all of the fragments they use) fall into the same context.) This setting

allows a fragment to treat each application which calls it as a separate entity.

• Per-load data sharing allocates a data section each time the fragment is loaded,

even if multiple load requests come from the same context. You might use this

feature to implement a “network communications” fragment where each time

you load the fragment you get another connection to the network. Each connection

would get its own copy of the static data.

The Fragment Loading API

Fragments are loaded through calls to the Code Fragment Manager, as documented

in FragLoad.h. The FragLoad API consists of 7 calls for loading and unloading fragments,

as well as looking up function and data pointers in a loaded fragment:

• GetDiskFragment is commonly used for loading “plug-in additions” - it takes an

FSSpec for a file and loads the container contained in the data fork.

• GetMemFragment “prepares” a fragment that was loaded into memory without

using the Code Fragment Manager.

• GetSharedLibrary locates an existing Shared Library on the disk and loads it. This

call is normally used to get a connection ID for one of the standard shared

libraries (such as the System software “InterfaceLib”) before looking up the

address of an exported function.

• CloseConnection is used to unload a fragment. This function takes the Connection

ID returned by one of the above calls, checks a reference count which is

maintained by the CFM, and unloads the code and/or data sections for the given

fragment.

• FindSymbol looks up a symbol (code or data pointer) by name given a string and a

connection ID. If your application supports drop-in additions similar to

HyperCard XCMDs, you could use this to look up the address of an optional

function in an addition. As we will see later, this function plays a key role in

linking 68K code to a PowerPC application.

• CountSymbols counts the total number of symbols exported from a fragment. Like

FindSymbol, this call also requires a connection ID.

• GetIndSymbol can be used to index through all of the symbols exported from a

fragment. It takes a connection ID and an index number, and returns the symbol’s

name and address.

The API calls in action

We now know enough to start loading and executing code contained in a PEF

container. The following code loads a fragment from the data fork of a file and prepares

it for execution. (This code is part of “SimpleApp”, which is located on the MacTech

source code disks or from the MacTech forums on line.)

/* 1 */
FSSpec fileSpec;
ProcPtr mainAddr;
ConnectionID connID;
OSErr err;
Str255 errName;
err = GetDiskFragment ( &fileSpec, 0, 0, fileSpec.name,
kLoadNewCopy, &mainAddr, (Ptr*)&gMain,
errName);

That’s all there is to it - once this call is made, “mainAddr” contains a pointer

into the fragment (usually the entry point for the code, though the fragment could put a

pointer to some static data there if it wanted to export a table of procedure pointers,

for example.) The returned connection ID can be used to look up other exports from the

fragment, or to unload the fragment via a call to CloseConnection:

CloseConnection(&connID);

Loading a resource-based fragment is a little more difficult: it’s your

responsibility to get the fragment into memory, then you have to ask the CFM to

“prepare” the fragment for execution:

/* 2 */
FSSpec fileSpec;
ProcPtr mainAddr;
ConnectionID connID;
short refNum;
Handle codeResource;
OSErr err;
Str255 errName;
refNum = FSpOpenResFile(&fileSpec, fsCurPerm);
if (ResError() == noErr) {
  codeResource = Get1IndResource('PEF ', 1);
  if (codeResource != NULL) {
    DetachResource(codeResource );
    HLock(codeResource );
    // We have the code, but it's not ready to use yet
    // Ask the Code Fragment Manager to "prepare" the code for
execution
    err = GetMemFragment (*codeResource , 0, fileSpec.name,
kLoadNewCopy, &connID, (Ptr*)&mainAddr,
errName);
  }
  CloseResFile(refNum);

In both of these cases, if the load fails, err will return an appropriate error code

and errName will contain the name of the offending fragment.

Using global variables in fragments -

the Initialization routine

One of the more interesting aspects of fragments is how global variables are

supported. Every fragment may have its own static data, including global variables,

automatically, and this data can be initialized to set values by the Code Fragment

Manager. Thus, if you wrote:

static int x = 1234;

static ProcPtr y = &z;

x would receive the value 1234 (stored in the container’s data segment), and y

would receive the address of function “z”. (This is actually the address of a Transition

Vector, as described in “How TOC switches occur” below, and is computed at load

time.)

However, the CFM allows fragments to go beyond these simple initializations.

Each fragment may designate optional “Initialization” and optional “Termination”

routines which will be called when the fragment’s data section is being set up and torn

down. The Initialization routine receives a pointer to a block of useful information,

including a FSSpec for the fragment’s file, and can return an error value to stop the

fragment from loading. (Otherwise, the Initialization routine must return noErr.) The

termination routine takes no parameters and returns non values, and release memory,

close files, and otherwise undo whatever the initialization routine did.

The initialization and termination routines could be used to implement fully

self-initializing Macintosh managers, for instance, or C++ classes complete with

their own constructors and destructors. In the sample application, the initialization

routine is used to get the FSSpec for a drop-in addition file from within that file’s

code. This allows the addition to access its own resources.

/* 3 */
OSErr OurInitRoutine (InitBlockPtr initBlkPtr)
{
OSErr err = noErr;
short refNum = -1;
// Make sure this code is coming from the data fork of a file
if (initBlkPtr->fragLocator.where == kOnDiskFlat) {
refNum = FSpOpenResFile(
initBlkPtr->fragLocator.u.onDisk.fileSpec,
fsCurPerm);
// and so on

Next month in Powering Up

Next month’s column takes a look at the Power Macintosh calling conventions, and

at how that affects debugging.

Referenced by (3):