As a guest user you are not logged in or recognized by your IP address. You have
access to the Front Matter, Abstracts, Author Index, Subject Index and the full
text of Open Access publications.
This paper describes PALEX, a new source code representation in XML which contains parsing actions and lexical formatting information. Once a compiler finishes analyzing source code, it can record a list of parsing actions (shift, reduce and reading a token) during a compiling process. PALEX contains the recorded parsing actions and also lexical formatting information, white spaces and comments. It has two features as a source code representation in XML. The first is that it is independent of programming languages. The second is that the original source code can be restored from the PALEX code. For a prototype implementation, the Java compiler GCJ was modified. Syntax rules and lexical definitions are prepared for development of software tools which read the PALEX code and play back the parsing actions. An experiment was designed and the result shows that the size of the generated PALEX code is very large. However, the XML-based source code representation together with parsing actions offsets the cost of the large storage space.