The first day of the project started with a focus on getting TF-IDF set up and connecting the data to the format that scikit-learn expects. Also, a brief diversion to start with small data before going to the big data approach.
Articles tagged with Python
Project Concept: Supercharging Evernote Building
I'll admit it. I'm a document hoarder. For years I've been collecting and tagging notes in Evernote as I'll read through them to save for later. Can I use that hoarding to learn better?
FormaK Week in Review 2023-01-13 Building
It's been a little while since I last wrote about Formak ( FormaK Under The Hood: Optimization for scikit-learn integration , Sat 08 October 2022). Since then, I've been busy adding functionality to FormaK and I've also been improving the tooling for the project, inspired by Boring Python: code quality. The latest piece of functionality is the C++ code generation. The PR is in progress, but the initial experiments have proved out that the generation pipeline is feasible and able to be integrated into bazel.
FormaK Under The Hood: Optimization for scikit-learn integration Building
For the new scikit-learn feature, let's dive into how the model gets fit to data.
FormaK Coming Soon: Scikt-Learn Integration Building
There's a new feature coming to Formak: integration with sckit-learn. This enables quickly integrating new features such as model selection, pipelines and other data tooling.
FormaK: Python Code Generation Building
The second feature for FormaK landed: generating Python models
Taking Notes in Jupyter Notebooks Building
Taking notes in Markdown (LaTeX?) and Python