All Databases MacTech Vol 08-1992

Self Modifying Code

Volume Number: 8

Issue Number: 4

Column Tag: Article Rebuttal

Self Modifying code is a No-No!

A better way to do an event patch without self-modifying code or

Assembly

By Scott T. Boyd, Apple Computer, Inc. and Mike Scanlin, MacTutor

Regular Contributing Author

Note: Source code files accompanying article are located on MacTech CD-ROM or

source code disks.

Mike Scanlin’s article “Rotten Apple INIT for April Fool’s” brings up a minor

but essential point. The GetNextEvent patch looks like:

;1
@first
 Lea @exitAddress, A0
 Move.L (SP)+,(A0)
 Lea @ eventRecPtr,A0
 Move.L (SP),(A0)
 Pea @tailPatch
 DC JmpInstruction
@origTrap
 NOP
 NOP

The NOPs get replaced with the original GetNextEvent trap address by the

installation code:

;2
 Lea @origTrap,A1
 Move.L A0,(A1)

Now, consider the processor instruction cache. It’s a piece of the processor

which remembers what’s at a set of memory locations. It does this so the CPU won’t

have to do a memory access for recently referenced instructions. This is designed to

save time. It’s a neat hardware feature.

However, the Macintosh system software doesn’t make a distinction between code

and data. That’s different from OSs like Unix, which keep code and data in separate

address spaces. When Mike’s code installs the original trap address with the Move.L

A0,(A1), it’s putting an address into the middle of a piece of code. Unfortunately, the

cache which records the new value is the data cache.

The instruction cache has no clue that instructions just changed. This is one way

of doing what’s commonly called “self-modifying code”.

Self-modifying code is, in general, a bad thing to write. Apple has long

discouraged, and continues to discourage self-modifying code.

This is bad in this case because the processor, if it were to execute this code

right away, might believe (for some unspecified reason) that those memory locations

were cached in the instruction cache. If it did, it would pull whatever had been in that

location the last time code was executed from that spot, then try to execute whatever

was there. The odds that the instruction cache held the value you just put into the data

cache are not in your favor.

Contrast that approach with this approach:

;3
@first
 ...
 Move.L @origTrap,A0
 JMP.L (A0)
 ...
/* branch around this, or put it somewhere else, but don't let the PC run
through here */
@origTrap
 NOP
 NOP

While this approach still stores the old address into a piece of code, it’s never

referenced as code by the processor. It’s treated specifically as data. The instruction

cache never comes into play since the original address is moved as data.

Yet another approach, which also saves a register:

;4
@first
...
Move.L @origTrap,-(SP)
RTS
...

TN #261: “Cache As Cache Can” discusses this topic in more detail, e specially

with regard to moving whole chunks of code around.

As it happens, the caches are almost certainly flushed before this particular

eight bytes ever get loaded for execution, but that’s a happy coincidence, and not

something you should rely on. What’s happening is that we have made several traps

flush the caches (guaranteeing that there won’t be any misunderstanding about

something being in the instruction cache when it’s not), but we may change our minds

about which traps should flush, and when. You shouldn’t count on any given trap’s

current cache-flushing behavior.

One final consideration. Putting data into code does not work if code is ever

write-protected, and that may happen one day. So where can you put something when

you can’t allocate any global storage (e.g., PC-relative data or low-memory globals

with a fixed address)? You can use NewGestalt to register a new selector. When you

call Gestalt, it can return a value which is actually a pointer (or handle) to your global

data. This technique won’t work well if you can’t afford to make the trap call (like

from some time-sensitive routine you’ve patched), but it works nicely if you have the

time and you want to avoid putting data into your patch code.

Scott T Boyd, Apple Computer, Inc.

Mike Scanlin Says

Scott's point about stale code in the instruction cache is well taken and I deserve

a thumping for having written it. I made the poor judgement call that it wouldn't matter

in this case because I expected the instruction cache to be flushed between the time the

patch installation code finished and the first time the patch code was executed. I hang my

head in shame.

As partial retribution (and to satisfy a few requests for a non-assembly

version) I have written a trap patching shell in C that doesn't use any self-modifying

code (see listing below). It obeys all of the rules except for the one about storing data

into a code segment (Scott's solutions have this problem, too, as he mentions). Until we

have write-protected code segments, this will not be a problem.

Mike Scanlin

/*********************************************************
 * PatchGNE.c:
 * This INIT installs a patch on GetNextEvent and
 * SystemEvent that intercepts keyDown and autoKey  events.
 * For this example, the intercepted key  events are
 * converted to lower case if both the capsLock key and the
 * shiftKey are down (thus making the Mac keyboard behave
 * like an IBM keyboard). However, you can use this shell to
 * do generalized event intercepting as well as generalized
 * trap patching (with no asm and no self-modifying code).
 * If your patches need globals, put them in the
 * PatchGlobals struct and initialize them in main.
 * In Think C, set the Project Type to Code Resource, the
 * File Type to INIT, the Creator to anything, the Type to
 * INIT, the ID to something like 55 (55 will work but it
 * doesn't have to be 55), turn Custom Header ON and Attrs
 * to 20 (purgeable) and Multi Segment OFF.
 *
 * Mike Scanlin. 16 May 1992.
 *********************************************************/
#include "Traps.h
/**********************************************************
 * typedefs
 *********************************************************/
typedef pascal short (*GNEProcPtr)(short  eventMask,
 EventRecord *theEvent);
typedef pascal short (*SEProcPtr)(EventRecord *theEvent);
typedef struct PatchGlobals {
 GNEProcPtr pgOldGNE;
 SEProcPtr pgOldSE;
} PatchGlobals, *PatchGlobalsPtr;
/**********************************************************
 * prototypes
 *********************************************************/
void main(void);
void StartPatchCode(void);
pascal short MyGetNextEvent(short  eventMask, EventRecord
 *theEvent);
pascal short MySystemEvent(EventRecord *theEvent);
void CheckKeyCase(EventRecord *theEvent);
void EndPatchCode(void);
/**********************************************************
 * main:
 * Gets some memory in the system heap and installs the GNE
 * and SE patches (as well as allocating and initializing
 * the patc 8.4 Self Modifying Codeutine that gets
 * executed at startup time (by the INIT mechanism).
 *
 * The block of memory that main allocates will look like
 * this when main has finished:
 *
 *                   +--------------------+
 *                   |    PatchGlobals    |
 *                   +--------------------+
 *                   |  StartPatchCode()  |
 *  GNE trap addr -> +--------------------+
 *                   |  MyGetNextEvent()  |
 *   SE trap addr -> +--------------------+
 *                   |  MySystemEvent()   |
 *                   +--------------------+
 *                   |   CheckKeyCase()   |
 *                   +--------------------+
 *                   |   EndPatchCode()   |
 *                   +--------------------+
 *
 *********************************************************/
void main()
{
    Ptr             patchPtr;
    PatchGlobalsPtr pgPtr;
    long            codeSize, offset;
    /* try and get some memory in the system heap for code
       and globals */
    codeSize = (long) EndPatchCode - (long) StartPatchCode;
    patchPtr = NewPtrSys(codeSize + sizeof(PatchGlobals));
    if (!patchPtr)
        return; /* out of memory -- abort patching */
    /* initialize the patch globals at the beginning
       of the block */
    pgPtr = (PatchGlobalsPtr) patchPtr;
    pgPtr->pgOldGNE = (GNEProcPtr)
      GetTrapAddress(_GetNextEvent);
    pgPtr->pgOldSE = (SEProcPtr)
      GetTrapAddress(_SystemEvent);
    /* move the code into place after the globals */
    BlockMove(StartPatchCode, patchPtr +
      sizeof(PatchGlobals), codeSize);
    /* set the patches */
    patchPtr += sizeof(PatchGlobals);
    offset = (long) MyGetNextEvent - (long) StartPatchCode;
    SetTrapAddress((long) patchPtr + offset, _GetNextEvent);
    offset = (long) MySystemEvent - (long) StartPatchCode;
    SetTrapAddress((long) patchPtr + offset, _SystemEvent);
}
/**********************************************************
 * StartPatchCode:
 * Dummy proc to mark the beginning of the code for the
 * patches.  Make sure all of your patch code is between
 * here and EndPatchCode.
*********************************************************/
void StartPatchCode()
{
}
/*********************************************************
 * MyGetNextEvent:
 * Tail patch on GetNextEvent.
 *
 * The reason this returns a short instead of a Boolean is
 * because we need to make sure the low byte of the top word
 * on the stack is zero because some programs do a Tst.W
 * (SP)+ when this returns instead of Tst.B (SP)+ like they
 * should (which is technically their bug but, we might as
 * well work around it since it's not hard).
 *
 * If you want to eat the event and not pass it on to the
 * caller then set returnValue to zero.
 *********************************************************/
pascal short MyGetNextEvent(short  eventMask,
  EventRecord *theEvent)
{
    PatchGlobalsPtr pgPtr;
    short           returnValue;
    /* find our globals */
    pgPtr = (PatchGlobalsPtr) ((long) StartPatchCode -
      sizeof(PatchGlobals));
    /* call original GNE first */
    returnValue = (*pgPtr->pgOldGNE)( eventMask, theEvent);
    /* do some post-processing */
    CheckKeyCase(theEvent);
    /* return to original caller */
    return (returnValue);
}
/**********************************************************
 * MySystemEvent:
 * Tail patch on SystemEvent.
 *
 * The reason this returns a short instead of a Boolean is
 * because we need to make sure the low byte of the top word
 * on the stack is zero because some programs do a Tst.W
 * (SP)+ when this returns instead of Tst.B (SP)+ like they
 * should (which is technically their bug but, we might as
 * well work around it since it's not hard).
 *
 * We need this patch as well as the one on GetNextEvent
 * because of desk accessories. If you don't patch
 * SystemEvent then the patch will not apply to  events that
 * are sent to DAs.
 *
 * If you want to eat the event and not pass it on to the
 * caller then set returnValue to zero.
 *********************************************************/
pascal short MySystemEvent(EventRecord *theEvent)
{
    PatchGlobalsPtr pgPtr;
    short           returnValue;
    /* find our globals */
    pgPtr = (PatchGlobalsPtr) ((long) StartPatchCode -
      sizeof(PatchGlobals));
    /* call original GNE first */
    returnValue = (*pgPtr->pgOldSE)(theEvent);
    /* do some post-processing */
    CheckKeyCase(theEvent);
    /* return to original caller */
    return (returnValue);
}
/*********************************************************
 * CheckKeyCase:
 * If theEvent was a keyDown or autoKey  event, this checks
 * if both the shiftKey and the capsLock key were down. If
 * so, it changes theEvent to be a lowercase letter. If not,
 * nothing is changed.  Also, if either the optionKey or
 * cmdKey is down then nothing is changed.
 ********************************************************/
void CheckKeyCase(EventRecord *theEvent)
{
    register long   theMods, theMessage;
    register char   theChar;
    if (theEvent->what == keyDown ||
      theEvent->what == autoKey) {
        theMods = theEvent->modifiers;
        theMods &= shiftKey | alphaLock |
          optionKey | cmdKey;
        theMods ^= shiftKey | alphaLock;
        if (!theMods) {
            theMessage = theEvent->message;
            theChar = theMessage & charCodeMask;
            if (theChar >= 'A' && theChar <= 'Z') {
                theMessage &= ~charCodeMask;
                theMessage |= theChar + 'a' - 'A';
                theEvent->message = theMessage;
            }
        }
    }
}
/*********************************************************
 * EndPatchCode:
 * Dummy proc to mark the end of the code for the patches.
 * Make sure all of your patch code is between here and
 * StartPatchCode.
 *********************************************************/
void EndPatchCode()
{
}

Referenced by (5):