forked from ashumkin/RegExpr
-
Notifications
You must be signed in to change notification settings - Fork 0
/
History.txt
322 lines (298 loc) · 15.4 KB
/
History.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
---------------------------------------------------------------
History
---------------------------------------------------------------
Legend:
(+) - new feature
(-) - bugfix
(^) - improvement
v. 0.952 2004.01.11
-=- (+) FreePascal support, thanks to Yaroslav Romanchenko (SAGE)
v. 0.951 2002.09.29
-=- (+) Defined constant EscChar ('\' by default). Can be usefull for
C++ Builder users who are tired with r.e. like '\\w+\\\\\\w+\\.\\w+'.
Now they can define for example EscChar='/' and write '/w+\/w+/./w+' -
sligtly unusual but more readable.. Suggested by unonimous CPPB user.
-=- (+) Overloaded versions of Exec (only for Delphi 5 and higher).
-=- (-) ExecNext in some cases didn't raise exception if called after
unsuccessful exec*, that is wrong (ExecNext must be called ONLY after
successful Exec* ). Bug reported by Craig A. Peterson
-=- (+) ReplaceEx and overloaded version of Replace - for
complex, context-sensitive replacements. Implemented by Thor Asmund.
-=- (+) Use StrLCopy instead of StpPCopy - in SetExpression() as 0.927
did in SetInputString(). Fixed by Maley, Scott D.
-=- (-) Bug in HyperLinkDecorator.DecorateURLs.
-=- (-) Delphi 2 compatibility restored.
-=- (^) Delphi 7 compatibility tested, Ok.
-=- (+) New demos.
-=- (^) Documentation and distribution package were reorganized.
v. 0.948 2002.01.03
-=- (-) back-offset calculation in UniCode version due to
Delphi's PWideChar subtraction 'feature' (produced
Integer overflow (if Overflow checking is on) and,
sometimes, wrong operation), bug reported by Craig A. Peterson
-=- (-) Bug that was found by Lars Karlslund was fixed by Martin Fuller
(in addition Martin made some excellent source code optimizations).
The bug was: "If I do '(something|^$)' on '' I get false (which is
wrong ...).". Fixed ExecPrim (empty strings were not allowed to pass
thru in UseFirstCharSet mode) and FillFirstCharSet (many bugs).
-=- (-) Visualization bug in Dump method. Fixed by Martin Fuller.
v. 0.947 2001.10.03
-=- (+) Word boundary (\b & \B) metachar
-=- (-) Bug in processing predefined char.classes in non-UseSetOfChar mode
-=- (+) Spanish help - translated by Diego Calp ([email protected]), Argentina
-=- (+) VersionMajor/Minor class method of TRegExpr ;)
-=- (-) Bug in CompileRegExpr, Thanks to Oleg Orlov <[email protected]>
-=- (^) Method RegExprSubExpressions wasn't compatible with D2-D3.
Thanks to Eugene Tarasov for bug report.
-=- (+) Method Replace can now do substitution as well (see documentation)
Thanks to Warren Bare, Ken Friesen and many others who suggested it.
-=- (+) Updated ReplaceRegExpr to use new Replace method functionality
-=- (^) Restored UniCode compatibility lost in some previous version
Thanks to Stephan Klimek for bug report
-=- (^) Updated TestRE project, new examples for Replace with substitution
included.
v. 0.942+ 2001.03.01
-=- (+) Published French help for TRegExpr,
translated by Martin Ledoux
v. 0.942 2001.02.12
-=- (-) Range-check error in DEMO-project (due to bug in
RegExprSubExpressions), Thanks to Juergen Schroth
-=- (^) RegExprSubExpressions - added error codes for "unclosed "[" error
-=- (^) Help file bug fixing
v. 0.941 2001.02.01
-=- (^) Attension! Behaviour of '\w', '\W' was changed! Now it really
match alphanum characters and '_' as described in documentation,
not only alpha as it was before. Thanks to Vadim Alexandrov.
If You want to restore previous behaviour, reassign
RegExprWordChars (exclude '0123456789' from it).
-=- (+) Full compatible with recommended at unicode.org implementation
of modifier /m, including DOS-styled line separators (\r\n) mixed
with Unix styled (\n) - see properties LineSeparators, LinePairedSeparator
-=- (^) Attension! Behaviour of '.' was changed! Now if modifier /s is off
it doesn't match all chars from LineSeparators and LinePairedSeparator (by
default \r and \n)
-=- (^) Attension! To prevent unneeded recompilation of r.e., now assignment
to Expression or changing modifiers doesn't cause immidiate [re]compilation.
So, now You don't get exception while assigning wrong expression, but can
get exception while calling Exec[Next], Substitute, Dump, etc if there
are errors in Expression or other properties.
-=- (+) Non-greedy style iterators (like '*?'), modifier /g.
Implemented with help from Matthew Winter and Filip Jirsák
-=- (+) /x modifier (eXtended syntax - allow formating r.e., see description
in the help)
-=- (+) Procedure Compile to [re]compile r.e. Usefull for GUI r.e. editors
and so on (to check all properties validity).
-=- (+) FAQ in documentation. I am too lazy to answer to the same
questions again and again :( Please, read the FAQ before sending
question to me!
-=- (^) DEMO project have been significantly improved. Now this is the
real r.e. debugger! Thanks to Jon Smith for his ideas.
-=- (+) function RegExprSubExpressions, usefull for GUI editors of
r.e. (see example of using in TestRExp.dpr project)
-=- (+) HyperLinkDecorator unit - practical example of TRegExpr
using (see description in the help file)
-=- (-) Range checking error in some cases if ComplexBraces defined
Thanks to Juergen Schroth
-=- (^) 'ComplexBraces' now is defined by default
-=- (+) Kit Eason sent to me many examples for 'Syntax' help section
and I decided to complitely rewrite this section. I hope, You'll enjoy
the results ;)
-=- (+) The \A and \Z metacharacters are just like "^" and "$", except
that they won't match multiple times when the modifier /m is used
v. 0.939 2000.10.04
-=- (-) Bug in Substitute method ($10.. didn't work properly)
Thanks to Serge S Klochkovski
v. 0.938 2000.07.23
-=- (^) Exeptions now jump to appropriate source line, not
to Error procedure (I am not quite sure this is safe for
all compiler versions. You can turn it off - remove
reRealExceptionAddr definition below).
-=- (^) Forgotten BSUBEXP[CI] in FillFirstCharSet caused
exeption 'memory corruption' in case if back reference can
be first op, like this: (a)*\1 (first subexpression can be
skipped and we'll start matching with back reference..).
v. 0.937 2000.06.12
-=- (-) Bug in optimization engine (since v.0.934). In some cases
TRegExpr didn't catch right strings.
Thanks to Matthias Fichtner
v. 0.936 2000.04.22
-=- (+) Back references, like <font size=(['"]?)(\d+)\1>, see
manual for details
-=- (+) Wide hex char support, like '\x{263a}'
v. 0.935 2000.04.19 (by Yury Finkel)
-=- (-) fInvertCase now isn't readonly ;)
-=- (-) UniCode mode compiling errors
v. 0.934 2000.04.17
-=- (^) New ranges implementation (range matching now is very fast
- uses one(!) CPU instruction)
-=- (^) Internal p-code structure converted into 32-bits - works
faster and now there is no 64K limit for compiled r.e.
-=- (^) '{m,n}' now use 32-bits arguments (up to 2147483646) - specially
for Dmitry Veprintsev ;)
-=- (^) Ranges now support metachars: [\n-\x0D] -> #10,#11,#12,#13;
Changed '-' processing, now it's like in Perl:
[\d-t] -> '0'..'9','-','t'; []-a] -> ']'..'a'
-=- (-) Bug with \t and etc macro (they worked only in ranges)
Thanks to Yury Finkel
-=- (^) Added new preprocessing optimization (see FirstCharSet).
Incredible fast (!). But be carefull it isn's properly tested.
You can switch it Off - remove UseFirstCharSet definition.
-=- (^) Many other speed optimizations
-=- (-) Case-insensitive mode now support system-defined national
charset (due to bug in v.0.90 .. 0.926 supported only english one)
-=- (^) Case-insensitive mode implemented with InvertCase (param &
result of REChar type) - works 10 .. 100 times faster.
-=- (^) Match and ExecNext interfaces optimized, added IsProgrammOk
by Ralf Junker
-=- (^) Increased NSUBEXP (now 15) and fixed code for this, now you
can simply increase NSUBEXP constant by yourself.
Suggested by Alexander V. Akimov.
-=- (^+) Substitute adapted for NSUBEXP > 10 and significant (!)
optimized, improved error checking.
ATTENTION! Read new Substitute description - syntax was changed !
-=- (+) SpaceChars & WordChars property - now you may change chars
treated as \s & \w. By defauled assigned RegExprSpaceChars/WordChars
-=- (+) Now \s and \w supported in ranges
-=- (-) Infinite loop if end of range=#$FF
Thanks to Andrey Kolegov
-=- (+) Function QuoteRegExprMetaChars (see description)
-=- (+) UniCode support - sorry, works VERY slow (remove '.' from
{.$DEFINE UniCode} after this comment for unicode version).
Implemented by Yury Finkel
v. 0.926 2000.02.26
-=- (-) Old bug derived from H.Spencer sources - SPSTART was
set for '?' and '*' instead of '*', '{m,n}' and '+'.
-=- (-^) Now {m,n} works like Perl's one - error occures only
if m > n or n > BracesMax (BracesMax = 255 in this version).
In other cases (no m or nondigit symbols in m or n values,
or no '}') symbol '{' will be compiled as literal.
Note: so, you must include m value (use {0,n} instead of {,n}).
Note: {m,} will be compiled as {m,BracesMax}.
-=- (-^) CaseInsensitive mode now support ranges
'(?i)[a]' == '[aA]'
-=- (^) Roman-number template in TestRExp ;)
-=- (+^) Beta version of complex-braces - like ((abc){1,2}|d){3}
By default its turned off. If you want take part in beta-testing,
please, remove '.' from {.$DEFINE ComplexBraces} below this comments.
-=- (-^) Removed \b metachar (in Perl it isn't BS as in my implementation,
but word bound)
-=- (+) Add /s modifier. Bu I am not sure that it's ok for Windows.
I implemented it as [^\n] for '.' metachar in non-/s mode.
But lines separated by \n\r in windows. I need you suggestions !
-=- (^) Sorry, but I had to rename Modifiers to ModifierStr
(ModifierS uses for /s now)
v. 0.91 2000.02.02
-=- (^) some changes in documentation and demo-project.
v. 0.90 2000.01.31
-=- (+) implemented braces repetitions {min,max}.
Sorry - only simple cases now - like '\d{2,3}'
or '[a-z1-9]{,7}', but not (abc){2,3} ..
I still too short in time.
Wait for future versions of TRegExpr or
implement it by youself and share with me ;)
-=- (+) implemented case-insensitive modifier and way
to work with other modifiers - see properties
Modifiers, Modifier, ModifierI
and (?ismx-ismx) Perl extension.
You may use global variables RegExpr* for assigning
default modifier values.
-=- (+) property ExtSyntaxEnabled changed to 'r'-modifier
(russian extensions - see documentation)
-=- (+) implemented (?#comment) Perl extension - very hard
and usefull work ;)
-=- (^) property MatchCount renamed to SubExprMatchCount.
Sorry for any inconvenients, but it's because new
version works slightly different and if you used
MatchCount in your programms you have to rethink
it ! (see comments to this property)
-=- (+) add InputString property - stores input string
from last Exec call. You may directly assign values
to this property for using in ExecPos method.
-=- (+) add ExecPos method - for working with assigned
to InputString property. You may use it like this
InputString := AString;
ExecPos;
or this
InputString := AString;
ExecPos (AOffset);
Note: ExecPos without parameter works only in
Delphi 4 or higher.
-=- (+) add ExecNext method - simple and fast (!) way to finding
multiple occurences of r.e. in big input string.
-=- (^) Offset parameter removed from Exec method, if you
used it in your programs, please replace all
Exec (AString, AOffset)
with combination
InputString := AString; ExecPos (AOffset)
Sorry for any inconvenients, but old design
(see v.0.81) was too ugly :(
In addition, multiple Exec calls with same input
string produce fool overhead because each Exec
reallocate input string buffer.
-=- (^) optimized implementation of Substitution,
Replace and Split methods
-=- (-) fixed minor bug - if r.e. compilation raise error
during second pass (!!! I think it's impossible
in really practice), TRegExpr stayed in 'compiled'
state.
-=- (-) fixed bug - Dump method didn't check program existance
and raised 'access violation' if previouse Exec
was finished with error.
-=- (+) changed error handling (see functions Error, ErrorMsg,
LastError, property CompilerErrorPos, type ERegExpr).
-=- (-^) TRegExpr.Replace, Split and ExecNext made a infinite
loop in case of r.e. match empty-string.
Now ExecNext moves by MatchLen if MatchLen <> 0
and by +1 if MatchLen = 0
Thanks to Jon Smith and George Tasker for bugreports.
-=- (-) While playing with null-matchs I discovered, that
null-match at tail of input string is never found.
Well, I fixed this, but I am not sure this is safe
(MatchPos[0]=length(AInputString)+1, MatchLen = 0).
Any suggetions are very appreciated.
-=- (^) Demo project and documentation was upgraded
-=- (^) Documentation and this version was published on my home page
v. 0.81 1999.12.25 // Merry Christmas ! :)
-=- added \s (AnySpace) and \S (NotSpace) meta-symbols
- implemented by Stephan Klimek with minor fixes by AVS
-=- added \f, \a and \b chars (translates into FF, BEL, BS)
-=- removed meta-symbols 'ö' & 'Ö' - sorry for any inconvenients
-=- added Match property (== copy (InputStr, MatchPos [Idx], MatchLen [Idx]))
-=- added extra parameter Offset to Exec method
(thanks to Steve Mudford)
v. 0.7 1999.08.22
-=- fixed bug - in some cases the r.e. [^...]
incorrectly processed (as any symbol)
(thanks to Jan Korycan)
-=- Some changes and improvements in TestRExp.dpr
v. 0.6 1999.08.13 // Friday 13 !
-=- changed header of TRegExpr.Substitute
-=- added Split, Replace & appropriate
global wrappers (thanks to Stephan Klimek for suggetions)
v. 0.5 1999.08.12
-=- TRegExpr.Substitute routine added
-=- Some changes and improvements in TestRExp.dpr
-=- Fixed bug in english version of documentation
(Thanks to Jon Buckheit)
v. 0.4 1999.07.20
-=- Fixed bug with parsing of strings longer then 255 bytes
(thanks to Guido Muehlwitz)
-=- Fixed bug in RegMatch - matches only first occurence of r.e.
(thanks to Stephan Klimek)
v. 0.3 1999.06.13
-=- ExecRegExpr function
v. 0.2 1999.06.10
-=- packed into object-pascal class
-=- code slightly rewriten for pascal
-=- now macro correct proceeded in ranges
-=- r.e.ranges syntax extended for russian letters ranges:
à-ÿ - replaced with all small russian letters (Win1251)
À-ß - replaced with all capital russian letters (Win1251)
à-ß - replaced with all russian letters (Win1251)
-=- added macro '\d' (opcode ANYDIGIT) - match any digit
-=- added macro '\D' (opcode NOTDIGIT) - match not digit
-=- added macro '\w' (opcode ANYLETTER) - match any english letter or '_'
-=- added macro '\W' (opcode NOTLETTER) - match not english letter or '_'
(all r.e.syntax extensions may be turned off by flag ExtSyntax)
v. 0.1 1999.06.09
first version, with bugs, without help => must die :(