#Lasso #LinearRegression "is useful in some contexts due to its tendency to prefer solutions with fewer non-zero coefficients, effectively reducing the number of features upon which the given solution is dependent"

https://scikit-learn.org/stable/modules/linear_model.html#lasso

scikit-learn1.1. Linear ModelsThe following are a set of methods intended for regression in which the target value is expected to be a linear combination of the features. In mathematical notation, if\hat{y} is the predicted val...

#dataDev #AIDev #ML

**Peter Drake** @peterdrake@mstdn.social · Oct 18, 2024

Oct 18, 2024

Peter Drake @peterdrake@mstdn.social

I'm playing with the California Housing dataset built into sklearn.

One census block group has an average number of bedrooms per household of 0.83 and an average number of household members of 1243.

Huh?

#DataScience #python #sklearn

**Peter Jachim** @pjachim@mastodon.world · Oct 15, 2024

Oct 15, 2024

Peter Jachim @pjachim@mastodon.world

I just did my first project using the #mlflow library to track metrics on iterations of manual tuning of an #sklearn pipeline, it works great and gives me some idea of the search space before moving into automated hyperparameter tuning.

I am using it in a super basic way, as an alternative to creating a gazillion cells with comments tracking metrics, does anyone have any favorite features to check out for taking mlflow to the next level?
#machinelearning #python #MLOps #scikitlearn

**➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ** @AeonCypher@lgbtqia.space · Sep 9, 2024

Sep 9, 2024

➴➴➴Æ🜔Ɲ.Ƈꭚ⍴𝔥єɼ @AeonCypher@lgbtqia.space

I genuinely miss PyMC2. The #PyMC and #Arviz APIs changes so frequently, that it's impossible to know what the standard approach to anything is.

#Bayesian #Statistics in #Python should be easy.

To be honest, I'd really like a well maintained #SkLearn module for it.

**Joxean Koret (@matalaz)** @joxean@mastodon.social · Aug 31, 2024

Aug 31, 2024

Joxean Koret (@matalaz) @joxean@mastodon.social

Uhm... if I get a decision tree like the one shown in the picture, does it mean that I only need the columns shown in the tree for training and validation, right? I would only need the columns 2 and 3 (x[2], x[3]), isn't it? Or am I missing something else?

#Sklearn #MachineLearning #ML

**IB Teguh TM** @teguhteja@mastodon.social · Aug 26, 2024

Aug 26, 2024

IB Teguh TM @teguhteja@mastodon.social

#LinearRegression #Python #Sklearn
Dive into predictive modeling with our comprehensive guide on linear regression using Python and sklearn. Learn step-by-step implementation, result interpretation, and data visualization techniques. Perfect for beginners

https://teguhteja.id/mastering-linear-regression-with-python-and-sklearn-a-step-by-step-guide/

teguhteja.id · Aug 26, 2024Mastering Linear Regression with Python and Sklearn: A Step-by-Step GuideLinear regression Python sklearn guide: Master predictive modeling. Learn implementation, visualization, and practical applications.

**Joxean Koret (@matalaz)** @joxean@mastodon.social · Aug 16, 2024

Aug 16, 2024

Joxean Koret (@matalaz) @joxean@mastodon.social

When training a model it turns out that I get better results with a small dataset than with a bigger dataset. This is what is called overfiting, right?
#MachineLearning #Sklearn

**Joxean Koret (@matalaz)** @joxean@mastodon.social · Jul 17, 2024

Jul 17, 2024

Joxean Koret (@matalaz) @joxean@mastodon.social

Dear Machine Learning people: when a problem can be solved using both a regressor and a classifier, which method would you choose? Or you simply try both and then choose whatever worked better? Any rule or set of rules to try to determine which method should work better?

#MachineLearning #sklearn #ML

**Qiita - 人気の記事** @qiita@rss-mstdn.studiofreesia.com · Jul 14, 2024

Jul 14, 2024

Qiita - 人気の記事 @qiita@rss-mstdn.studiofreesia.com

【実装】ボストンの住宅価格推測AIを作ろう【後編】
https://qiita.com/realmadridmarcelo/items/0ac4490de2007e43f65c?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

Qiita【実装】ボストンの住宅価格推測AIを作ろう【後編】 - Qiitaはじめまして始めまして！株式会社 Panta Rheiの「かず」と申します。pandasista という名前でXをやっております。以後お見知り置きを！本記事は4年前に勉強会で用いた記事になります！…

#qiita #Python #機械学習

**James Ashford** @blog@jrashford.com · Mar 25, 2024 *

Mar 25, 2024 *

James Ashford @blog@jrashford.com

In my job as a data analyst, I come across many different types of problems to solve. Some are relatively easy to solve, others not so much. That was until recently, where I came across a problem I have never given much thought before. That was until now.

What is the problem? Finding multiple peaks in a dataset.

You might think, this sounds […]

https://jrashford.com/2024/03/25/finding-peaks-in-a-dataset-and-why-it-is-not-straightforward/

#algorithm #data #datavis

**synlogic** @synlogic@toot.io · Mar 15, 2024

Mar 15, 2024

synlogic @synlogic@toot.io

anyone know of a FOSS lib equiv to Python's Scikit-learn (sklearn) but in/for Go?

(and to forestall an obvious suggestion which is likely a non-starter for my needs: yes I am aware of idea of wrapping it or otherwise linking out to it from Go, that is my worst case fallback, but avoiding it. ideal is a 100% pure Go source-to-binary solution)

#Golang
#Python
#sklearn
#ScikitLearn
#ML
#stats
#statistics
#math
#FOSS

**PyData Madrid** @pydatamadrid@masto.ai · Feb 15, 2024

Feb 15, 2024

PyData Madrid @pydatamadrid@masto.ai

Ya está abierto el registro para nuestra reunión de febrero: Eficiencia operacional con LLMs y pipelines de scikit-learn, este mes en las oficinas de Adyen

https://www.meetup.com/pydata-madrid/events/299189759/

¡Nos vemos el jueves 22 a las 19:00! Y después, networking

Meetup🔍 Eficiencia operacional con LLMs y pipelines de scikit-learn, Thu, Feb 22, 2024, 7:00 PM | MeetupPyData Madrid vuelve en febrero para hablar de Python, Datos, Visualización, Inteligencia Artificial, ¡y lo que surja! Este mes nos juntaremos en las oficinas de Adyen (ca

#PyDataMadrid #PyData #python

Replied to Buck Baskin

**FormaK** @formak@fosstodon.org · Jan 30, 2024

Jan 30, 2024

FormaK @formak@fosstodon.org

@buck The feature has landed! FormaK now supports hyper-parameter selection and cross validation with a new structured state machine interface. Under the hood it’s using scikit-learn. As always, it can be built into a #Python or #Cpp model or #KalmanFilter

https://github.com/buckbaskin/formak/pull/21

GitHubHyperparameter Selection by buckbaskin · Pull Request #21 · buckbaskin/formakBy buckbaskin

#sklearn #OpenSource

**scikit-learn** @sklearn@fosstodon.org · Jan 19, 2024

Jan 19, 2024

scikit-learn @sklearn@fosstodon.org

Discover scikit-learn 1.4 and its:
5 major features & 13 features
14 efficiency improvements & 23 enhancements
15 API changes
38 fixes

More details in the changelog: https://bit.ly/3tWlZA3
or in the release highlights: https://bit.ly/3Hsoddm

You can upgrade with pip as usual:
pip install -U scikit-learn

Or using the conda-forge builds:
conda install -c conda-forge scikit-learn

Thanks again to all the +80 contributors!

#scikitlearn #Python #sklearn

**Olivier Grisel** @ogrisel · Dec 7, 2023 *

Dec 7, 2023 *

Olivier Grisel @ogrisel

I ran a quick Gradient Boosted Trees vs Neural Nets check using scikit-learn's dev branch which makes it more convenient to work with tabular datasets with mixed numerical and categorical features data (e.g. the Adult Census dataset).

Let's start with the GBRT model. It's now possible to reproduce the SOTA number of this dataset in a few lines of code 2 s (CV included) on my laptop.

1/n

#sklearn #PyData #MachineLearning

**Buck Baskin** @buck@fosstodon.org · Nov 9, 2023

Nov 9, 2023

Buck Baskin @buck@fosstodon.org

I’m starting a new feature for @formak: semi-automated hyper-parameter selection for models and Kalman Filters.

You can read the design doc for the feature here: https://github.com/buckbaskin/formak/blob/hyperparameter-selection/docs/designs/hyperparameter_selection.md

Feedback on the design is welcome here or on GitHub

GitHubformak/docs/designs/hyperparameter_selection.md at hyperparameter-selection · buckbaskin/formakContribute to buckbaskin/formak development by creating an account on GitHub.

#Python #sklearn #KalmanFilter

**Olivier Grisel** @ogrisel · Sep 22, 2023

Sep 22, 2023

Olivier Grisel @ogrisel

scikit-learn 1.3.1 is out!

This release fixes a bunch of annoying bugs. Here is the changelog:

https://scikit-learn.org/stable/whats_new/v1.3.html#version-1-3-1

Thanks very much to all bug reporters, PR authors and reviewers and thanks in particular to @glemaitre, the release manager of 1.3.1.

scikit-learnVersion 1.3.2October 2023 Changelog: sklearn.datasets: All dataset fetchers now accept data_home as any object that implements the os.PathLike interface, for instance, pathlib.Path.#27468 by Yao Xiao.. sklearn....

#PyData #SciPy #sklearn

**Olivier Grisel** @ogrisel · Apr 21, 2023 *

Apr 21, 2023 *

Olivier Grisel @ogrisel

I recently dived into a rabbit hole when attempting to fix the tests for #sklearn's OLS and Ridge regression solvers.

On the theoretical side, I now understand that the minimum norm solution for the centered problem without intercept is also the minimum norm solution for the original problem (with intercept). Ridge/OLS on centered X & y followed by intercept computation is the approach (hereafter name type "a") we have been using for years.

https://raw.githubusercontent.com/ogrisel/minimum-norm-ols/main/minimum-norm-ols-intercept.pdf

#PyData #Statistics

Recent searches

Search options

Administered by:

Server stats:

#Sklearn