ruby.git - The Ruby Programming Language

Age	Commit message (Collapse)	Author
2024-07-26	Fix memory leak in String#start_with? when regexp times out	Peter Zhu
	[Bug #20653] This commit refactors how Onigmo handles timeout. Instead of raising a timeout error, onig_search will return a ONIGERR_TIMEOUT which the caller can free memory, and then raise a timeout error. This fixes a memory leak in String#start_with when the regexp times out. For example: regex = Regexp.new("^#{"(a)" 10_000}x$", timeout: 0.000001) str = "a" * 1000000 + "x" 10.times do 100.times do str.start_with?(regex) rescue end puts `ps -o rss= -p #{$$}` end Before: 33216 51936 71152 81728 97152 103248 120384 133392 133520 133616 After: 14912 15376 15824 15824 16128 16128 16144 16144 16160 16160 Notes: Merged: https://github.com/ruby/ruby/pull/11247
2024-07-16	Add MatchData#bytebegin and MatchData#byteend	Shugo Maeda
	These methods return the byte-based offset of the beginning or end of the specified match. [Feature #20576]
2024-04-18	Add a hint of `ASCII-8BIT` being `BINARY`	Jean Boussier
	[Feature #18576] Since outright renaming `ASCII-8BIT` is deemed to backward incompatible, the next best thing would be to only change its `#inspect`, particularly in exception messages.
2024-02-02	Fix memory leak in OnigRegion when match raises	Peter Zhu
	[Bug #20228] rb_reg_onig_match can raise a Regexp::TimeoutError, which would cause the OnigRegion to leak.
2024-02-02	Fix memory leak in stk_base when Regexp timeout	Peter Zhu
	[Bug #20228] If rb_reg_check_timeout raises a Regexp::TimeoutError, then the stk_base will leak.
2024-01-07	* expand tabs. [ci skip]	git
	Please consider using misc/expand_tabs.rb as a pre-commit hook.
2024-01-08	Adjust styles and indents [ci skip]	Nobuyoshi Nakada

2024-01-01	Don't create T_MATCH object if /regexp/.match(string) doesn't match	Luke Gruber
	Fixes [Bug #20104]
2023-12-24	Fix Regexp#inspect for GC compaction	Peter Zhu
	rb_reg_desc was not safe for GC compaction because it took in the C string and length but not the backing String object so it get moved during compaction. This commit changes rb_reg_desc to use the string from the Regexp object. The test fails when RGENGC_CHECK_MODE is turned on: TestRegexp#test_inspect_under_gc_compact_stress [test/ruby/test_regexp.rb:474]: <"(?-mix:\\/)\|"> expected but was <"/\\x00\\x00\\x00\\x00\\x00\\x00\\x00\\x00\\x00\\x00/">.
2023-12-24	Fix Regexp#match for GC compaction	Peter Zhu
	The test fails when RGENGC_CHECK_MODE is turned on: TestRegexp#test_match_under_gc_compact_stress: NoMethodError: undefined method `match' for nil test_regexp.rb:878:in `block in test_match_under_gc_compact_stress'
2023-12-23	Fix Regexp#to_s for GC compaction	Peter Zhu
	The test fails when RGENGC_CHECK_MODE is turned on: TestRegexp#test_to_s_under_gc_compact_stress = 13.46 s 1) Failure: TestRegexp#test_to_s_under_gc_compact_stress [test/ruby/test_regexp.rb:81]: <"(?-mix:abcd\u3042)"> expected but was <"(?-mix:\u5C78\u3030\u5C78\u3030\u5C78\u3030\u5C78\u3030\u5C78\u3030)">.
2023-12-19	[DOC] State MatchData#[] when multiple captures with the same name	Nobuyoshi Nakada

2023-12-14	[DOC] Adjust some new features wording/examples. (#9183)	Victor Shepelev
	* Reword Range#overlap? docs last paragraph. * Docs: add explanation about Queue#freeze * Docs: Add :rescue event docs for TracePoint * Docs: Enhance Module#set_temporary_name documentation * Docs: Slightly expand Process::Status deprecations * Fix MatchData#named_captures rendering glitch * Improve Dir.fchdir examples * Adjust Refinement#target docs
2023-12-06	Copy encoding flags when copying a regex [Bug #20039]	Dustin Brown
	* :bug: Fixes [Bug #20039](https://bugs.ruby-lang.org/issues/20039) When a Regexp is initialized with another Regexp, we simply copy the properties from the original. However, the flags on the original were not being copied correctly. This caused an issue when the original had multibyte characters and was being compared with an ASCII string. Without the forced encoding flag (`KCODE_FIXED`) transferred on to the new Regexp, the comparison would fail. See the included test for an example. Co-authored-by: Nobuyoshi Nakada <[email protected]>
2023-11-14	[DOC] Fix doc/regexp.rdoc links	Nobuyoshi Nakada
	- Rename regexp.rdoc to exclude from "Pages". This file is for to be included in the "class Regexp" document, but it also appeared as a separate page duplicately. - Fix links on case-sensitive filesystems. - Fix to use rdoc-ref instead of converted HTML page names.
2023-10-20	[DOC] Indentation fix in comments of MatchData#inspect	Herwin
	The old version did not add syntax highlighting to the code block, and included the "Related:" line in the code block as well.
2023-10-20	[DOC] Fix typo in docs of Regexp#deconstruct_keys	Herwin
	of => if
2023-07-31	Reuse Regexp ptr when recompiling	Peter Zhu
	When matching an incompatible encoding, the Regexp needs to recompile. If `usecnt == 0`, then we can reuse the `ptr` because nothing else is using it. This avoids allocating another `regex_t`. This speeds up matches that switch to incompatible encodings by 15%. Branch: ``` Regex#match? with different encoding 1.431M (± 1.3%) i/s - 7.264M in 5.076153s Regex#match? with same encoding 16.858M (± 1.1%) i/s - 85.347M in 5.063279s ``` Base: ``` Regex#match? with different encoding 1.248M (± 2.0%) i/s - 6.342M in 5.083151s Regex#match? with same encoding 16.377M (± 1.1%) i/s - 82.519M in 5.039504s ``` Script: ``` regex = /foo/ str1 = "日本語" str2 = "English".force_encoding("ASCII-8BIT") Benchmark.ips do \|x\| x.report("Regex#match? with different encoding") do \|times\| i = 0 while i < times regex.match?(str1) regex.match?(str2) i += 1 end end x.report("Regex#match? with same encoding") do \|times\| i = 0 while i < times regex.match?(str1) i += 1 end end end ```
2023-07-27	Resurrect rb_reg_prepare_re C API	Takashi Kokubun
	Existing strscan releases rely on this C API. It means that the current Ruby master doesn't work if your Gemfile.lock has strscan unless it's locked to 3.0.7, which is not released yet. To fix it, let's not remove the C API we've exposed to users.
2023-07-27	Don't load RREGEXP_PTR twice	Peter Zhu

2023-07-27	Refactor err string in rb_reg_prepare_re	Peter Zhu

2023-07-27	Add function rb_reg_onig_match	Peter Zhu
	rb_reg_onig_match performs preparation, error handling, and cleanup for matching a regex against a string. This reduces repetitive code and removes the need for StringScanner to access internal data of regex. Notes: Merged: https://github.com/ruby/ruby/pull/8123
2023-07-20	Embed struct rmatch into GC slot (#8097)	Kunshan Wang

2023-06-27	Stop allocating unused backref strings at `defined?`	Nobuyoshi Nakada
	Notes: Merged: https://github.com/ruby/ruby/pull/7983
2023-06-27	Use `rb_reg_nth_defined` instead of `rb_match_nth_defined`	Nobuyoshi Nakada
	Notes: Merged: https://github.com/ruby/ruby/pull/7983
2023-06-20	[DOC] Regexp doc (#7923)	Burdette Lamar
	Notes: Merged-By: peterzhu2118 <[email protected]>
2023-06-09	* expand tabs. [ci skip]	git
	Please consider using misc/expand_tabs.rb as a pre-commit hook.
2023-06-09	Optimize `Regexp#dup` and `Regexp.new(/RE/)`	Nobuyoshi Nakada
	When copying from another regexp, copy already built `regex_t` instead of re-compiling its source. Notes: Merged: https://github.com/ruby/ruby/pull/7922
2023-04-23	Use UTF-8 encoding for literal extended regexps with UTF-8 characters in ↵	Jeremy Evans
	comments Fixes [Bug #19455] Notes: Merged: https://github.com/ruby/ruby/pull/7592
2023-04-19	MatchData#named_captures: add optional symbolize_names keyword (#6952)	Vladimir Dementyev
	Notes: Merged-By: ioquatix <[email protected]>
2023-04-06	[Feature #19474] Refactor NEWOBJ macros	Matt Valentine-House
	NEWOBJ_OF is now our canonical newobj macro. It takes an optional ec Notes: Merged: https://github.com/ruby/ruby/pull/7393
2023-03-06	Stop exporting symbols for MJIT	Takashi Kokubun
	Notes: Merged: https://github.com/ruby/ruby/pull/7459
2023-03-06	[DOC] Fix options of `Regexp#initialize`	Nobuyoshi Nakada
	`Integer#\|` is bit-wise OR operator, not logical OR. Notes: Merged: https://github.com/ruby/ruby/pull/7435
2023-03-06	`rb_scan_args` never fills optional arguments with `Qundef`	Nobuyoshi Nakada
	Notes: Merged: https://github.com/ruby/ruby/pull/7435
2023-03-03	[Bug #19471] `Regexp.compile` should handle keyword arguments	Nobuyoshi Nakada
	As well as `Regexp.new`, it should pass keyword arguments to the `Regexp#initialize` method. Notes: Merged: https://github.com/ruby/ruby/pull/7431
2023-03-01	Remove support for the Regexp.new 3rd argument	Jeremy Evans
	This was deprecated in Ruby 3.2. Fixes [Bug #18797] Notes: Merged: https://github.com/ruby/ruby/pull/7039
2023-02-26	Adjust `else` style to be consistent in each files [ci skip]	Nobuyoshi Nakada

2023-02-19	Remove (newly unneeded) remarks about aliases	BurdetteLamar

2023-02-10	Implement Write Barrier for RMatch objects	Jean Boussier
	They only have two references. Notes: Merged: https://github.com/ruby/ruby/pull/7286
2023-02-10	[DOC] Fix typo in document of regexp [ci skip]	OKURA Masafumi
	Notes: Merged: https://github.com/ruby/ruby/pull/7283 Merged-By: nobu <[email protected]>
2023-02-09	Remove `REG_LITERAL` flag	Nobuyoshi Nakada
	All `Regexp` literals are frozen now. Notes: Merged: https://github.com/ruby/ruby/pull/7276
2023-01-30	Fix parsing of regexps that toggle extended mode on/off inside regexp	Jeremy Evans
	This was broken in ec3542229b29ec93062e9d90e877ea29d3c19472. That commit didn't handle cases where extended mode was turned on/off inside the regexp. There are two ways to turn extended mode on/off: ``` /(?-x:#y)#z /x =~ '#y' /(?-x)#y(?x)#z /x =~ '#y' ``` These can be nested inside the same regexp: ``` /(?-x:(?x)#x (?-x)#y)#z /x =~ '#y' ``` As you can probably imagine, this makes handling these regexps somewhat complex. Due to the nesting inside portions of regexps, the unassign_nonascii function needs to be recursive. In recursive mode, it needs to track both opening and closing parentheses, similar to how it already tracked opening and closing brackets for character classes. When scanning the regexp and coming to `(?` not followed by `#`, scan for options, and use `x` and `i` to determine whether to turn on or off extended mode. For `:`, indicting only the current regexp section should have the extended mode switched, recurse with the extended mode set or unset. For `)`, indicating the remainder of the regexp (or current regexp portion if already recursing) should turn extended mode on or off, just change the extended mode flag and keep scanning. While testing this, I noticed that `a`, `d`, and `u` are accepted as options, in addition to `i`, `m`, and `x`, but I can't see where those options are documented. I'm not sure whether or not handling `a`, `d`, and `u` as options is a bug. Fixes [Bug #19379] Notes: Merged: https://github.com/ruby/ruby/pull/7192
2023-01-16	[DOC] Correction to RDoc for Regexp.new (#7130)	Burdette Lamar
	Correction to RDoc for Regexp.new Notes: Merged-By: BurdetteLamar <[email protected]>
2022-12-22	Always issue deprecation warning when calling Regexp.new with 3rd positional ↵	Jeremy Evans
	argument Previously, only certain values of the 3rd argument triggered a deprecation warning. First step for fix for bug #18797. Support for the 3rd argument will be removed after the release of Ruby 3.2. Fix minor fallout discovered by the tests. Co-authored-by: Nobuyoshi Nakada <[email protected]> Notes: Merged: https://github.com/ruby/ruby/pull/6976
2022-12-22	Refactor `reg_extract_args` to return regexp if given	Nobuyoshi Nakada

2022-12-22	Share argument parsing in `Regexp#initialize` and `Regexp.linear_time?`	Nobuyoshi Nakada
	Notes: Merged: https://github.com/ruby/ruby/pull/6988
2022-12-19	typo in doc [ci skip]	卜部昌平

2022-12-19	Note about Regexp.linera_time? [ci skip]	卜部昌平

2022-12-14	Add `Regexp.linear_time?` (#6901)	TSUYUSATO Kitsune
	Notes: Merged-By: makenowjust <[email protected]>
2022-12-02	Introduce encoding check macro	S-H-GAMELINKS
	Notes: Merged: https://github.com/ruby/ruby/pull/6700