mirror of
https://github.com/fhswf/aki_prj23_transparenzregister.git
synced 2025-06-22 05:13:55 +02:00
7620efc6da847faec3ffbc0f08d92117c43667ec
Bumps [rapidfuzz](https://github.com/maxbachmann/RapidFuzz) from 3.4.0 to 3.5.1. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/maxbachmann/RapidFuzz/releases">rapidfuzz's releases</a>.</em></p> <blockquote> <h2>Release 3.5.1</h2> <h2>Fixed</h2> <ul> <li>fix compilation failure on macOS</li> </ul> <h2>Release 3.5.0</h2> <h2>Changed</h2> <ul> <li>skip pandas <code>pd.NA</code> similar to <code>None</code></li> <li>add <code>score_multiplier</code> argument to <code>process.cdist</code> which allows multiplying the end result scores with a constant factor.</li> <li>drop support for Python 3.7</li> </ul> <h2>Performance</h2> <ul> <li>improve performance of simd implementation for <code>LCS</code> / <code>Indel</code> / <code>Jaro</code> / <code>JaroWinkler</code></li> <li>improve performance of Jaro and Jaro Winkler for long sequences</li> <li>implement <code>process.extract</code> with <code>limit=1</code> using <code>process.extractOne</code> which can be faster</li> </ul> <h2>Fixed</h2> <ul> <li>the preprocessing function was always called through Python due to a broken C-API version check</li> <li>fix wraparound issue in simd implementation of Jaro and Jaro Winkler</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/maxbachmann/RapidFuzz/blob/main/CHANGELOG.rst">rapidfuzz's changelog</a>.</em></p> <blockquote> <h2>Changelog</h2> <p>[3.5.1] - 2023-10-31 ^^^^^^^^^^^^^^^^^^^^ Fixed</p> <pre><code>* fix compilation failure on macOS <p>[3.5.0] - 2023-10-31 ^^^^^^^^^^^^^^^^^^^^ Changed </code></pre></p> <ul> <li>skip pandas <code>pd.NA</code> similar to <code>None</code></li> <li>add <code>score_multiplier</code> argument to <code>process.cdist</code> which allows multiplying the end result scores with a constant factor.</li> <li>drop support for Python 3.7</li> </ul> <p>Performance</p> <pre><code>* improve performance of simd implementation for ``LCS`` / ``Indel`` / ``Jaro`` / ``JaroWinkler`` * improve performance of Jaro and Jaro Winkler for long sequences * implement ``process.extract`` with ``limit=1`` using ``process.extractOne`` which can be faster <p>Fixed</p> <pre><code>* the preprocessing function was always called through Python due to a broken C-API version check * fix wraparound issue in simd implementation of Jaro and Jaro Winkler [3.4.0] - 2023-10-09 ^^^^^^^^^^^^^^^^^^^^ Changed </code></pre> <ul> <li>upgrade to <code>Cython==3.0.3</code></li> <li>add simd implementation for Jaro and Jaro Winkler</li> </ul> <p>[3.3.1] - 2023-09-25 ^^^^^^^^^^^^^^^^^^^^ Added</p> <pre><code>* add missing tag for python 3.12 support [3.3.0] - 2023-09-11 ^^^^^^^^^^^^^^^^^^^^ Changed </code></pre> <ul> <li>upgrade to <code>Cython==3.0.2</code></li> <li>implement the remaining missing features from the C++ implementation in the pure Python implementation</li> </ul> <p>Added </tr></table> </code></pre></p> </blockquote> <p>... (truncated)</p> </details> <details> <summary>Commits</summary> <ul> <li><a href="5b35b11ead
"><code>5b35b11</code></a> release 3.5.1</li> <li><a href="f2ec8c1a2d
"><code>f2ec8c1</code></a> fix compilation failure on macOS</li> <li><a href="c1f0d0d1f6
"><code>c1f0d0d</code></a> fix wraparound issue in Jaro/JaroWinkler</li> <li><a href="ea6962ad00
"><code>ea6962a</code></a> update changelog</li> <li><a href="d089e1fe92
"><code>d089e1f</code></a> fetch tags in submodules</li> <li><a href="0e39abdff7
"><code>0e39abd</code></a> update python version</li> <li><a href="438cb51e9f
"><code>438cb51</code></a> update rapidfuzz-cpp</li> <li><a href="c68752cc9f
"><code>c68752c</code></a> add coverage badge</li> <li><a href="cc1e9a11ea
"><code>cc1e9a1</code></a> test preprocessing function with distance scorers</li> <li><a href="41dfce987a
"><code>41dfce9</code></a> improve coverage for python preprocessing function</li> <li>Additional commits viewable in <a href="https://github.com/maxbachmann/RapidFuzz/compare/v3.4.0...v3.5.1">compare view</a></li> </ul> </details> <br /> [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details>
aki_prj23_transparenzregister
Contributions
See the CONTRIBUTING.md about how code should be formatted and what kind of rules we set ourselves.
Available entrypoints
The project has currently the following entrypoint available:
- data-transformation > Transfers all the data from the mongodb into the sql db to make it available as production data.
- reset-sql > Resets all sql tables in the connected db.
- copy-sql > Copys the content of a db to another db.
- webserver > Starts the webserver showing the analysis results.
DB Connection settings
To connect to the SQL db see sql/connector.py To connect to the Mongo db see [connect]
Create a secrets.json
in the root of this repo with the following structure (values to be replaces by desired config):
The sqlite db is alternative to the postgres section.
{
"sqlite": "path-to-sqlite.db",
"postgres": {
"username": "username",
"password": "password",
"host": "localhost",
"database": "db-name",
"port": 5432
},
"mongo": {
"username": "username",
"password": "password",
"host": "localhost",
"database": "transparenzregister",
"port": 27017
}
}
Alternatively, the secrets can be provided as environment variables. One option to do so is to add a .env
file with
the following layout:
PYTHON_POSTGRES_USERNAME=postgres
PYTHON_POSTGRES_PASSWORD=postgres
PYTHON_POSTGRES_HOST=localhost
PYTHON_POSTGRES_DATABASE=postgres
PYTHON_POSTGRES_PORT=5432
PYTHON_MONGO_USERNAME=username
PYTHON_MONGO_HOST=localhost
PYTHON_MONGO_PASSWORD=password
PYTHON_MONGO_PORT=27017
PYTHON_MONGO_DATABASE=transparenzregister
PYTHON_SQLITE_PATH=PathToSQLite3.db # An overwrite path to an sqllite db
PYTHON_DASH_LOGIN_USERNAME=some-login-to-webgui
PYTHON_DASH_LOGIN_PW=some-pw-to-login-to-webgui
CR=ghcr.io/fhswf/aki_prj23_transparenzregister
TAG=latest
HTTP_PORT=80
The prefix PYTHON_
can be customized by setting a different prefix
when constructing the ConfigProvider.
Description
Languages
Jupyter Notebook
84.5%
HTML
12.1%
Python
3.3%