- try to lower dependencies to avoid lme4 2024-06-17 removal threat
- work around r-lib/roxygen2#1491
- Fix some links.
- Remove LazyData decl.
- Move to tinytest
- Documentation improvements.
- More S3 methods.
- Back-port pyvtreat recommendation code to Rvtreat.
- Control imputation on design_missingness_treatment.
- Share cross-splits in multinomial mode to minimize data leakage.
- Earlier argument checking.
- Default classification target to TRUE.
- Some documentation improvements.
- Move wrapr to Depends.
- Check for same-frame on prepare paths.
- Pipe into ft.
- Rename multinomial arguments to match other methods.
- Move parallel to a suggestion and configure a global option to shut it down ( vtreat.allow_parallel ).
- Fix multinomial variable restriction.
- Fix FT multinomial columns copy.
- Work on ft serialization environments.
- More tests.
- Improve custom coder flexibility.
- Center/scale improvement from nfultz.
- Documentation updates.
- fit_tranform interface.
- fix wrong y-name in multinomial cross frame.
- Some doc updates.
- Some doc updates.
- Adjust license to GPL-2 | GPL-3
- Some doc updates.
- Fix bug in custom coder NA path and let numeric coder center if wanted.
- More tests.
- Fix eronious Cohen reference in documentation.
- Fancy level and variable names.
- More tests on odd level names (and collisions).
- Remove spline from default value variables.
- Fix ggplot2 vignette dependence issue.
- Make treatment plans pipeable targets.
- Fix design_missingness_treatment() prepare.
- Move to wrapr test runner.
- Add patch_columns_into_frame().
- Fix column name munging.
- Start switching to RUnit tests.
- Make categorical scoring default TRUE.
- logit-ize custom coders.
- spline custom coder.
- Get stricter about row ordering in building custom coders.
- neaten up printing a bit.
- Collar spline and lower its complexity.
- Accept pre-built approximation tables from numeric custom coders.
- Fix .customCodeNum() extrapolation, down-sampling, and remote numeric ".center" option.
- More docs.
- remove _clean suffix.
- Fix non-scalar comparisions (mostly form class vectors).
- add segmented variable calculation.
- add pre-computed split plan.
- bump dependencies.
- Example higher order coders.
- force parent.frame
- add a drop columns option to design_missingness_treatment()
- Doc fixes
- Fix variable naming in multi class case.
- Some doc fixes.
- Start on NaN and Infinity on databases.
- More tests.
- Fix z-split annotation issue.
- Change designTreatmentsZ() defaults.
- Documentation fixes.
- Multiclass treatments.
- Simple treatments.
- Fix CRAN Note on pre 3.5.0 R parallel usage.
- Smooth catP novel levels scoring a bit.
- Fix check error on parallel::getDefaultCluster()
- Fix issue 19.
- Do not use NULL to signal when parallelism is desired.
- default data.table merging on (may be a performance regression to not set this).
- rqdatatable treatment path.
- better formatting.
- add extracols argument to prepare().
- Improve regexps.
- Translate treatment plans to rquery.
- Minor documentation improvements.
- Improve error messages on argument checking.
- Improve name generation (remove dots).
- Remove dplyr dependence.
- Add data.table as an optional row-binder.
- Declare an R version.
- parallel indicator calculation.
- add optional warning on encountering novel levels in prepare.
- bind options.
- fix deps.
- restore parallel test.
- kWayStratifiedY performance fix from @khotilov Vadim Khotilovich.
- Minor documentation fixes.
- rm data.table (possibly related to issues 2413 or 2418).
- Minor documentation fixes.
- More work on novel values for non-centered custom coders.
- Numeric custom coders.
- Isotone examples.
- Add codeRestriction option to design steps.
- Prepare for custom models.
- Prefer data.table::rbindlist() for assembling frames.
- Add forceSplit mode.
- Add codeRestriction option to prepare().
- make prueSig an optional argument in prepare, and force by-name access.
- Remove left.op=TRUE from findInterval in "vtreat Rare Levels" vignette (seems to be a new addition to findInterval, so fails CRAN check r-oldrel-windows-ix86+x86_64).
- General improvements in documentation and vignettes.
- Document saving/loading treatment plans.
- Add 'rsq' column to scoreFrame (rsq- for numeric targets, pseudo-rsq for categorical targets).
- Fix treatment of constant columns in cross-partitions.
- Switch doCollar default to FALSE.
- Return indicators on designTreatmentsZ.
- Fix extraDegreesOfFreedom calculation.
- Allow repeated application rows in cross frames.
- Remove lsig and csig from scoreFrame.
- Add meanY to treatmentplan.
- Documentation fixes.
Changes:
- Change catB variables to delta-logit score.
- Fix passing of arguments to parallel prepare, and reduce size of data passed.
- Ensure set of variables consistency in mkCross* methods.
- More parallelization of level significance calculations, and cheaper chi-square test where appropriate.
- Facilities for y-stratified and grouped splitting (useful for unbalanced classes).
- catScaling=TRUE mode uses logistic regression for y-aware variable scaling.
- Fix erroneous bad type warning on date columns.
- Fix bug that caused catB variables to be scored as "insignificant".
- Add test to check for above bug.
- Expose cross validation controls.
- More guards on significance calculations.
- Fix issue of non-significant variables not being scaled.
- Documentation fixes, document variable types, improve vignettes.
- Minor performance fixes on result accumulation.
- Add optional use of dplyr for row binding (of score report frames).
- Calculations of non catX significances are now deterministic
- Catch mis-naming or non-varying y earlier
- Expose buildEvalSets as a public function
- More tests
- Isolated "cross" or "out of sample" frame generation into mkCrossFrameCExperiment() and mkCrossFrameNExperiment()
- Many small bug fixes and corner cases patched
- Simplified out of sample frame generation
- Added "no-Y" treatment option
- Minor documentation fixes
- Fixed bugs in cross-validated mode
- Force out of sample calculations in more situations
- Fix vignette titles
- Fix documentation
- Do not allow small or insufficiently varying data frames (exact conditions in documentation)
- Unified rare level treatment
- Separate treatment of insignificant levels
- Tests confirming compatibility with 'data.table'
- More special case hardening
- Over-fit vignette
- Minor documentation fixes
- First CRAN release