The lexical analyzer (Lexer) breaks code (written in sentences) into a series of known Token and pass the token stream to the ScriptStack.Compiler.Parser. Mehr ...

Öffentliche Methoden
	Lexer (List< string > lines)

List< Token >	GetTokens ()

Propertys
bool	EndOfSource `[get]`

Private Typen
enum	State { None , Divide , InlineComment , BlockComment , Assign , Plus , Minus , Multiply , Xor , Modulo , And , Or , Not , Greater , Less , Identifier , String , EscapeString , Number , Float , Hex , Bin , Oct , Char , BinaryNot }

Private Methoden
void	InvalidCharacter (char ch)

char	ReadChar ()

void	UndoChar ()

Private Attribute
List< string >	lines

int	line

int	column

State	state

Ausführliche Beschreibung

The lexical analyzer (Lexer) breaks code (written in sentences) into a series of known Token and pass the token stream to the ScriptStack.Compiler.Parser.

It is reading the source file character by character and removing any whitespace or comments. If the lexical analyzer finds a token invalid, it generates an error. More information is coming.

See https://en.wikipedia.org/wiki/Lexical_analysis

Noch zu erledigen

binary not '~'

keyword "new"

Dokumentation der Aufzählungstypen

◆ State

enum ScriptStack.Compiler.Lexer.State

private

Aufzählungswerte
None
Divide
InlineComment
BlockComment
Assign
Plus
Minus
Multiply
Xor
Modulo
And
Or
Not
Greater
Less
Identifier
String
EscapeString
Number
Float
Hex
Bin
Oct
Char
BinaryNot

        {
            None,
            Divide,
            InlineComment,
            BlockComment,
            Assign,
            Plus,
            Minus,
            Multiply,
            Xor,
            Modulo,
            And,
            Or,
            Not,
            Greater,
            Less,
            Identifier,
            String,
            EscapeString,
            Number,
            Float,
            Hex,
            Bin,
            Oct,
            Char,
            BinaryNot
        }

Beschreibung der Konstruktoren und Destruktoren

◆ Lexer()

ScriptStack.Compiler.Lexer.Lexer ( List< string > lines )

        {
 
            this.lines = new List<string>();
 
            foreach (string line in lines)
                this.lines.Add(line + "\r\n");
           
            line = 0;
 
            column = 0;
 
            state = State.None;
 
        }

Benutzt ScriptStack.Compiler.Lexer.column, ScriptStack.Compiler.Lexer.line, ScriptStack.Compiler.Lexer.lines und ScriptStack.Compiler.Lexer.state.

Dokumentation der Elementfunktionen

◆ GetTokens()

List< Token > ScriptStack.Compiler.Lexer.GetTokens ( )

Rückgabe

        {
 
            line = 0;
 
            column = 0;
 
            state = State.None;
 
            string lexeme = null;
 
            List<Token> tokenStream = new List<Token>();
 
            while (!EndOfSource)
            {
 
                string currentLine = lines[line];
 
                char ch = ReadChar();
 
                switch (state)
                {
 
                    case State.None:
                        switch (ch)
                        {
 
                            case ' ':
                            case '\t':
                            case '\r':
                            case '\n':
                                break;
                           
                            case '(':
                                tokenStream.Add(new Token(TokenType.LeftParen, "(", line, column, currentLine));
                                break;
                            case ')':
                                tokenStream.Add(new Token(TokenType.RightParen, ")", line, column, currentLine));
                                break;
                            case '[':
                                tokenStream.Add(new Token(TokenType.LeftBracket, "[", line, column, currentLine));
                                break;
                            case ']':
                                tokenStream.Add(new Token(TokenType.RightBracket, "]", line, column, currentLine));
                                break;
                            case '{':
                                tokenStream.Add(new Token(TokenType.LeftBrace, "{", line, column, currentLine));
                                break;
                            case '}':
                                tokenStream.Add(new Token(TokenType.RightBrace, "}", line, column, currentLine));
                                break;                               
                            case '.':
                                tokenStream.Add(new Token(TokenType.Period, ".", line, column, currentLine));
                                break;
                            case ':':
                                tokenStream.Add(new Token(TokenType.Colon, ":", line, column, currentLine));
                                break;
                            case ',':
                                tokenStream.Add(new Token(TokenType.Comma, ",", line, column, currentLine));
                                break;
                            case ';':
                                tokenStream.Add(new Token(TokenType.SemiColon, ";", line, column, currentLine));
                                break;
 
 
                            case '=':
                                state = State.Assign;
                                break;
                            case '+':
                                state = State.Plus;
                                break;
                            case '-':
                                state = State.Minus;
                                break;
                            case '*':
                                state = State.Multiply;
                                break;
                            case '/':
                                state = State.Divide;
                                break;
                            case '%':
                                state = State.Modulo;
                                break;
                            case '^':
                                state = State.Xor;
                                break;
                            case '&':
                                state = State.And;
                                break;
                            case '|':
                                state = State.Or;
                                break;
                            case '!':
                                state = State.Not;
                                break;
                            case '>':
                                state = State.Greater;
                                break;
                            case '<':
                                state = State.Less;
                                break;
                            case '\"':
                                lexeme = "";
                                state = State.String;
                                break;
                            case '\'':
                                lexeme = "";
                                state = State.Char;
                                break;
                            case '~':
                                state = State.BinaryNot;
                                break;
 
                            default:
                                if (char.IsLetter(ch) || ch == '_')
                                {
                                    state = State.Identifier;
                                    lexeme = "" + ch;
                                }
                                else if (char.IsDigit(ch))
                                {
                                    lexeme = "" + ch;
                                    state = State.Number;
                                }
                                else
                                    InvalidCharacter(ch);
                                break;
 
                        }
 
                        break;
 
                    case State.BinaryNot:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignBinaryNot, "~=", line, column, currentLine));
                            state = State.None;
                        }
                        break;
 
                    case State.Divide:
                        switch (ch)
                        {
                            case '/':
                                state = State.InlineComment;
                                break;
                            case '*':
                                state = State.BlockComment;
                                break;
                            case '=':
                                tokenStream.Add(new Token(TokenType.AssignDivide, "/=", line, column, currentLine));
                                state = State.None;
                                break;
                            default:
                                tokenStream.Add(new Token(TokenType.Divide, "/", line, column, currentLine));
                                UndoChar();
                                state = State.None;
                                break;
                        }
                        break;
 
                    case State.InlineComment:
                        // just read until a new line is encountered
                        if (ch == '\n')
                            state = State.None;
                        break;
 
                    case State.BlockComment:
                        if (ch == '*')
                        {
 
                            char next = ReadChar();
 
                            if (next == '/')
                            {
                                state = State.None;
                                break;
                            }
 
                        }
                        break;
 
                    case State.Assign:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.Equal, "==", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Assign, "=", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Plus:
                        if (ch == '+')
                        {
                            tokenStream.Add(new Token(TokenType.Increment, "++", line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignPlus, "+=", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Plus, "+", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Minus:
                        if (ch == '-')
                        {
                            tokenStream.Add(new Token(TokenType.Decrement, "--", line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignMinus, "-=", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Minus, "-", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Multiply:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignMultiply, "*=", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Multiply, "*", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Xor:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignXor, "^=", line, column, currentLine));
                            state = State.None;
                        }
                        break;
 
                    case State.Modulo:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignModulo, "%=", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Modulo, "%", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.And:
                        if (ch == '&')
                        {
                            tokenStream.Add(new Token(TokenType.And, "&&", line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignBinaryAnd, "&=", line, column, currentLine));
                            state = State.None;
                        }
                        else
                            InvalidCharacter(ch);
                        break;
 
                    case State.Or:
                        if (ch == '|')
                        {
                            tokenStream.Add(new Token(TokenType.Or, "||", line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.AssignBinaryOr, "|=", line, column, currentLine));
                            state = State.None;
                        }
                        else
                            InvalidCharacter(ch);
                        break;
 
                    case State.Not:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.NotEqual, "!=", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Not, "!", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Greater:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.GreaterEqual, ">=", line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == '>')
                        {
                            tokenStream.Add(new Token(TokenType.ShiftRight, ">>", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Greater, ">", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Less:
                        if (ch == '=')
                        {
                            tokenStream.Add(new Token(TokenType.LessEqual, "<=", line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == '<')
                        {
                            tokenStream.Add(new Token(TokenType.ShiftLeft, "<<", line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            tokenStream.Add(new Token(TokenType.Less, "<", line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Identifier:
 
                        if (char.IsLetterOrDigit(ch) || ch == '_')
                            lexeme += ch;
 
                        else
                        {
 
                            TokenType tokenType;
 
                            if (lexeme == "null")
                                tokenType = TokenType.Null;
                            else if (lexeme == "true" || lexeme == "false")
                                tokenType = TokenType.Boolean;
                            else if (lexeme == "if")
                                tokenType = TokenType.If;
                            else if (lexeme == "else")
                                tokenType = TokenType.Else;
                            else if (lexeme == "while")
                                tokenType = TokenType.While;
                            else if (lexeme == "for")
                                tokenType = TokenType.For;
                            else if (lexeme == "foreach")
                                tokenType = TokenType.Foreach;
                            else if (lexeme == "in")
                                tokenType = TokenType.In;
                            else if (lexeme == "switch")
                                tokenType = TokenType.Switch;
                            else if (lexeme == "case")
                                tokenType = TokenType.Case;
                            else if (lexeme == "default")
                                tokenType = TokenType.Default;
                            else if (lexeme == "break")
                                tokenType = TokenType.Break;
                            else if (lexeme == "continue")
                                tokenType = TokenType.Continue;
                            else if (lexeme == "function")
                                tokenType = TokenType.Function;
                            else if (lexeme == "return")
                                tokenType = TokenType.Return;
 
                            else if (lexeme == "shared")
                                tokenType = TokenType.Shared;
                            else if (lexeme == "var")
                                tokenType = TokenType.Var;
                            else if (lexeme == "volatile")
                                tokenType = TokenType.Volatile;
                            else if (lexeme == "struct")
                                tokenType = TokenType.Struct;
                            else if (lexeme == "enum")
                                tokenType = TokenType.Enum;
 
                            else if (lexeme == "include")
                                tokenType = TokenType.Include;
                            else if (lexeme == "lock")
                                tokenType = TokenType.Lock;
                            else if (lexeme == "run")
                                tokenType = TokenType.Run;
                            else if (lexeme == "yield")
                                tokenType = TokenType.Yield;
                            else if (lexeme == "notify")
                                tokenType = TokenType.Notify;
                            else if (lexeme == "wait")
                                tokenType = TokenType.Wait;
                            else
                                tokenType = TokenType.Identifier;
 
                            if (tokenType == TokenType.Boolean)
                            {
 
                                bool val = false;
 
                                if (lexeme == "true")
                                    val = true;
 
                                tokenStream.Add(new Token(tokenType, val, line, column, currentLine));
 
                            }
 
                            else
                                tokenStream.Add(new Token(tokenType, lexeme, line, column, currentLine));
 
                            UndoChar();
 
                            state = State.None;
 
                        }
                        break;
 
                    case State.Char:
                        /* \Todo */
                        while (ch != '\'')
                        {
                            lexeme += ch;
                            ch = ReadChar();
                        }
                        if (ch == '\'')
                        {
                            char c;
                            if (lexeme == "\\n") c = '\n';
                            else if (lexeme == "\\t") c = '\t';
                            else if (lexeme == "\\b") c = '\b';
                            else if (lexeme == "\\r") c = '\r';
                            else if (lexeme == "\\f") c = '\f';
                            else if (lexeme == "\\\'") c = '\'';
                            else if (lexeme == "\\\"") c = '\"';
                            else if (lexeme == "\\\\") c = '\\';
                            else c = char.Parse(lexeme);
                            tokenStream.Add(new Token(TokenType.Char, c, line, column, currentLine));
                            state = State.None;
                        }
                        else
                            throw new LexerException("Ein 'Character' darf genau ein Zeichen lang sein - ausgenommen Steuerzeichen!", line, column, lines[line]);
                        break;
 
                    case State.String:
                        if (ch == '\"') // string is ready!
                        {
                            tokenStream.Add(new Token(TokenType.String, lexeme, line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == '\\') // escape character, start string escape
                        {
                            state = State.EscapeString;
                        }
                        else if (ch == '\r' || ch == '\n') // if there is actually a line break inside the string..
                        {
                            throw new LexerException("Ein String darf sich nicht auf mehrere Zeilen erstrecken.", line, column, lines[line]);
                        }
                        else // just add the character
                        {
                            lexeme += ch;
                        }
                        break;
 
                    case State.EscapeString:
                        /*
                         * Always return to TokenState.String because we are inside a string!
                         */
                        if (ch == '"')
                        {
                            lexeme += '\"';
                            state = State.String;
                        }
                        else if (ch == '\\')
                        {
                            lexeme += ch;
                            state = State.String;
                        }
                        else if (ch == 'n')
                        {
                            lexeme += '\n';
                            state = State.String;
                        }
                        else if (ch == 't')
                        {
                            lexeme += '\t';
                            state = State.String;
                        }
                        else if (ch == 'r')
                        {
                            lexeme += '\r';
                            state = State.String;
                        }
                        else if (ch == 'n')
                        {
                            lexeme += '\n';
                            state = State.String;
                        }
                        else if (ch == 'b')
                        {
                            lexeme += '\b';
                            state = State.String;
                        }
                        else
                            throw new LexerException("Das Escapezeichen '\\" + ch + "' kann in Strings nicht verarbeitet werden.", line, column, lines[line]);
                        
                        break;
 
                    case State.Number:
                        /*
                         * In the phase of lexing numbers are also strings
                         * They are casted later on
                         */
                        if (char.IsDigit(ch))
                            lexeme += ch;
                        else if (ch == '.') // culture?!?
                        {
                            lexeme += '.';
                            state = State.Float;
                        }
                        else if (ch == 'x')
                        {
                            lexeme += ch;
                            state = State.Hex;
                        }
                        else if (ch == 'b')
                        {
                            int intValue = Convert.ToInt32(lexeme, 2);
                            // \todo in fact this is a 32 bit integer
                            tokenStream.Add(new Token(TokenType.Integer, intValue, line, column, currentLine));
                            state = State.None;
                        }
                        else if (ch == 'o')
                        {
                            int intValue = Convert.ToInt32(lexeme, 8);
                            tokenStream.Add(new Token(TokenType.Integer, intValue, line, column, currentLine));
                            state = State.None;
                        }
                        else
                        {
                            int intValue = int.Parse(lexeme);
                            tokenStream.Add(new Token(TokenType.Integer, intValue, line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Float:
                        if (char.IsDigit(ch))
                            lexeme += ch;
                        else
                        {
                            float floatValue = float.Parse(lexeme, System.Globalization.CultureInfo.InvariantCulture);
                            tokenStream.Add(new Token(TokenType.Float, floatValue, line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    case State.Hex:
                        if (char.IsDigit(ch) || char.IsLetter(ch))
                        {
                            if (char.IsLetter(ch) && !(ch >= 'a' && ch <= 'f') || (ch >= 'A' && ch <= 'F'))
                                throw new LexerException("Ein hexadezimaler Wert darf ausser Zahlen nur Buchstaben von 'a' - 'f' bzw. 'A' - 'F' enthalten.", line, column, currentLine);
                            lexeme += ch;
                        }
                        else
                        {
                            int intValue = Convert.ToInt32(lexeme, 16);
                            tokenStream.Add(new Token(TokenType.Integer, intValue, line, column, currentLine));
                            UndoChar();
                            state = State.None;
                        }
                        break;
 
                    default:
                        throw new LexerException("Unbekannter Lexer Status '" + state + "'.");
 
                }
 
            }
 
            if (state != State.None)
                throw new LexerException("Unerwartetes Ende des TokenStream.");         
 
            return tokenStream;
 
        }

Benutzt ScriptStack.Compiler.Lexer.column, ScriptStack.Compiler.Lexer.EndOfSource, ScriptStack.Compiler.Lexer.InvalidCharacter(), ScriptStack.Compiler.Lexer.line, ScriptStack.Compiler.Lexer.lines, ScriptStack.Compiler.Lexer.ReadChar(), ScriptStack.Compiler.Lexer.state und ScriptStack.Compiler.Lexer.UndoChar().

◆ InvalidCharacter()

void ScriptStack.Compiler.Lexer.InvalidCharacter ( char ch )

private

        {
            throw new LexerException("Unerwartetes Zeichen '" + ch + "'.\n", line, column, lines[line]);
        }

Benutzt ScriptStack.Compiler.Lexer.column, ScriptStack.Compiler.Lexer.line und ScriptStack.Compiler.Lexer.lines.

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens().

◆ ReadChar()

char ScriptStack.Compiler.Lexer.ReadChar ( )

private

        {
 
            if (EndOfSource)
                throw new LexerException("Das Ende des TokenStream wurde erreicht.");
 
            char ch = lines[line][column++];
 
            if (column >= lines[line].Length)
            {
 
                column = 0;
 
                ++line;
 
            }
 
            return ch;
        }

Benutzt ScriptStack.Compiler.Lexer.column, ScriptStack.Compiler.Lexer.EndOfSource, ScriptStack.Compiler.Lexer.line und ScriptStack.Compiler.Lexer.lines.

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens().

◆ UndoChar()

void ScriptStack.Compiler.Lexer.UndoChar ( )

private

        {
 
            if (line == 0 && column == 0)
                throw new LexerException("Der Anfang des TokenStream wurde erreicht.");
 
            --column;
 
            if (column < 0)
            {
 
                --line;
 
                column = lines[line].Length - 1;
 
            }
 
        }

Benutzt ScriptStack.Compiler.Lexer.column, ScriptStack.Compiler.Lexer.line und ScriptStack.Compiler.Lexer.lines.

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens().

Dokumentation der Datenelemente

◆ column

int ScriptStack.Compiler.Lexer.column

private

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens(), ScriptStack.Compiler.Lexer.InvalidCharacter(), ScriptStack.Compiler.Lexer.Lexer(), ScriptStack.Compiler.Lexer.ReadChar() und ScriptStack.Compiler.Lexer.UndoChar().

◆ line

int ScriptStack.Compiler.Lexer.line

private

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens(), ScriptStack.Compiler.Lexer.InvalidCharacter(), ScriptStack.Compiler.Lexer.Lexer(), ScriptStack.Compiler.Lexer.ReadChar() und ScriptStack.Compiler.Lexer.UndoChar().

◆ lines

List<string> ScriptStack.Compiler.Lexer.lines

private

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens(), ScriptStack.Compiler.Lexer.InvalidCharacter(), ScriptStack.Compiler.Lexer.Lexer(), ScriptStack.Compiler.Lexer.ReadChar() und ScriptStack.Compiler.Lexer.UndoChar().

◆ state

State ScriptStack.Compiler.Lexer.state

private

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens() und ScriptStack.Compiler.Lexer.Lexer().

Dokumentation der Propertys

◆ EndOfSource

bool ScriptStack.Compiler.Lexer.EndOfSource

getprivate

        {
            get { return line >= lines.Count; }
        }

Wird benutzt von ScriptStack.Compiler.Lexer.GetTokens() und ScriptStack.Compiler.Lexer.ReadChar().

Die Dokumentation für diese Klasse wurde erzeugt aufgrund der Datei:

C:/Users/manuel/Desktop/doc/Compiler/Lexer.cs

Öffentliche Methoden

Propertys

Private Typen

Private Methoden

Private Attribute

Ausführliche Beschreibung

Dokumentation der Aufzählungstypen

◆ State

Beschreibung der Konstruktoren und Destruktoren

◆ Lexer()

Dokumentation der Elementfunktionen

◆ GetTokens()

◆ InvalidCharacter()

◆ ReadChar()

◆ UndoChar()

Dokumentation der Datenelemente

◆ column

◆ line

◆ lines

◆ state

Dokumentation der Propertys

◆ EndOfSource