github-actions[bot] 7620efc6da build(deps): bump rapidfuzz from 3.4.0 to 3.5.1 (#298)
Bumps [rapidfuzz](https://github.com/maxbachmann/RapidFuzz) from 3.4.0
to 3.5.1.
<details>
<summary>Release notes</summary>
<p><em>Sourced from <a
href="https://github.com/maxbachmann/RapidFuzz/releases">rapidfuzz's
releases</a>.</em></p>
<blockquote>
<h2>Release 3.5.1</h2>
<h2>Fixed</h2>
<ul>
<li>fix compilation failure on macOS</li>
</ul>
<h2>Release 3.5.0</h2>
<h2>Changed</h2>
<ul>
<li>skip pandas <code>pd.NA</code> similar to <code>None</code></li>
<li>add <code>score_multiplier</code> argument to
<code>process.cdist</code> which allows multiplying the end result
scores
with a constant factor.</li>
<li>drop support for Python 3.7</li>
</ul>
<h2>Performance</h2>
<ul>
<li>improve performance of simd implementation for <code>LCS</code> /
<code>Indel</code> / <code>Jaro</code> / <code>JaroWinkler</code></li>
<li>improve performance of Jaro and Jaro Winkler for long sequences</li>
<li>implement <code>process.extract</code> with <code>limit=1</code>
using <code>process.extractOne</code> which can be faster</li>
</ul>
<h2>Fixed</h2>
<ul>
<li>the preprocessing function was always called through Python due to a
broken C-API version check</li>
<li>fix wraparound issue in simd implementation of Jaro and Jaro
Winkler</li>
</ul>
</blockquote>
</details>
<details>
<summary>Changelog</summary>
<p><em>Sourced from <a
href="https://github.com/maxbachmann/RapidFuzz/blob/main/CHANGELOG.rst">rapidfuzz's
changelog</a>.</em></p>
<blockquote>
<h2>Changelog</h2>
<p>[3.5.1] - 2023-10-31
^^^^^^^^^^^^^^^^^^^^
Fixed</p>
<pre><code>* fix compilation failure on macOS
<p>[3.5.0] - 2023-10-31
^^^^^^^^^^^^^^^^^^^^
Changed
</code></pre></p>
<ul>
<li>skip pandas <code>pd.NA</code> similar to <code>None</code></li>
<li>add <code>score_multiplier</code> argument to
<code>process.cdist</code> which allows multiplying the end result
scores
with a constant factor.</li>
<li>drop support for Python 3.7</li>
</ul>
<p>Performance</p>
<pre><code>* improve performance of simd implementation for ``LCS`` /
``Indel`` / ``Jaro`` / ``JaroWinkler``
* improve performance of Jaro and Jaro Winkler for long sequences
* implement ``process.extract`` with ``limit=1`` using
``process.extractOne`` which can be faster
<p>Fixed</p>
<pre><code>* the preprocessing function was always called through Python
due to a broken C-API version check
* fix wraparound issue in simd implementation of Jaro and Jaro Winkler

[3.4.0] - 2023-10-09
^^^^^^^^^^^^^^^^^^^^
Changed
</code></pre>
<ul>
<li>upgrade to <code>Cython==3.0.3</code></li>
<li>add simd implementation for Jaro and Jaro Winkler</li>
</ul>
<p>[3.3.1] - 2023-09-25
^^^^^^^^^^^^^^^^^^^^
Added</p>
<pre><code>* add missing tag for python 3.12 support

[3.3.0] - 2023-09-11
^^^^^^^^^^^^^^^^^^^^
Changed
</code></pre>
<ul>
<li>upgrade to <code>Cython==3.0.2</code></li>
<li>implement the remaining missing features from the C++ implementation
in the pure Python implementation</li>
</ul>
<p>Added
&lt;/tr&gt;&lt;/table&gt;
</code></pre></p>
</blockquote>
<p>... (truncated)</p>
</details>
<details>
<summary>Commits</summary>
<ul>
<li><a
href="5b35b11ead"><code>5b35b11</code></a>
release 3.5.1</li>
<li><a
href="f2ec8c1a2d"><code>f2ec8c1</code></a>
fix compilation failure on macOS</li>
<li><a
href="c1f0d0d1f6"><code>c1f0d0d</code></a>
fix wraparound issue in Jaro/JaroWinkler</li>
<li><a
href="ea6962ad00"><code>ea6962a</code></a>
update changelog</li>
<li><a
href="d089e1fe92"><code>d089e1f</code></a>
fetch tags in submodules</li>
<li><a
href="0e39abdff7"><code>0e39abd</code></a>
update python version</li>
<li><a
href="438cb51e9f"><code>438cb51</code></a>
update rapidfuzz-cpp</li>
<li><a
href="c68752cc9f"><code>c68752c</code></a>
add coverage badge</li>
<li><a
href="cc1e9a11ea"><code>cc1e9a1</code></a>
test preprocessing function with distance scorers</li>
<li><a
href="41dfce987a"><code>41dfce9</code></a>
improve coverage for python preprocessing function</li>
<li>Additional commits viewable in <a
href="https://github.com/maxbachmann/RapidFuzz/compare/v3.4.0...v3.5.1">compare
view</a></li>
</ul>
</details>
<br />


[![Dependabot compatibility
score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=rapidfuzz&package-manager=pip&previous-version=3.4.0&new-version=3.5.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores)

Dependabot will resolve any conflicts with this PR as long as you don't
alter it yourself. You can also trigger a rebase manually by commenting
`@dependabot rebase`.

[//]: # (dependabot-automerge-start)
[//]: # (dependabot-automerge-end)

---

<details>
<summary>Dependabot commands and options</summary>
<br />

You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits
that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after
your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge
and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating
it. You can achieve the same result by closing it manually
- `@dependabot show <dependency name> ignore conditions` will show all
of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop
Dependabot creating any more for this major version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop
Dependabot creating any more for this minor version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop
Dependabot creating any more for this dependency (unless you reopen the
PR or upgrade to it yourself)


</details>
2023-11-01 17:35:03 +00:00
2023-10-26 19:06:45 +02:00
2023-10-21 10:58:54 +02:00
2023-10-20 16:36:54 +02:00

aki_prj23_transparenzregister

python Actions status Ruff pre-commit Checked with mypy Documentation Status Code style: black

Contributions

See the CONTRIBUTING.md about how code should be formatted and what kind of rules we set ourselves.

Available entrypoints

The project has currently the following entrypoint available:

  • data-transformation > Transfers all the data from the mongodb into the sql db to make it available as production data.
  • reset-sql > Resets all sql tables in the connected db.
  • copy-sql > Copys the content of a db to another db.
  • webserver > Starts the webserver showing the analysis results.

DB Connection settings

To connect to the SQL db see sql/connector.py To connect to the Mongo db see [connect]

Create a secrets.json in the root of this repo with the following structure (values to be replaces by desired config):

The sqlite db is alternative to the postgres section.

{
  "sqlite": "path-to-sqlite.db",
  "postgres": {               
    "username": "username",      
    "password": "password",
    "host": "localhost",
    "database": "db-name",
    "port": 5432
  },
  "mongo": {
    "username": "username",
    "password": "password",
    "host": "localhost",
    "database": "transparenzregister",
    "port": 27017
  }
}

Alternatively, the secrets can be provided as environment variables. One option to do so is to add a .env file with the following layout:

PYTHON_POSTGRES_USERNAME=postgres
PYTHON_POSTGRES_PASSWORD=postgres
PYTHON_POSTGRES_HOST=localhost
PYTHON_POSTGRES_DATABASE=postgres
PYTHON_POSTGRES_PORT=5432

PYTHON_MONGO_USERNAME=username
PYTHON_MONGO_HOST=localhost
PYTHON_MONGO_PASSWORD=password
PYTHON_MONGO_PORT=27017
PYTHON_MONGO_DATABASE=transparenzregister

PYTHON_SQLITE_PATH=PathToSQLite3.db # An overwrite path to an sqllite db

PYTHON_DASH_LOGIN_USERNAME=some-login-to-webgui
PYTHON_DASH_LOGIN_PW=some-pw-to-login-to-webgui

CR=ghcr.io/fhswf/aki_prj23_transparenzregister
TAG=latest

HTTP_PORT=80

The prefix PYTHON_ can be customized by setting a different prefix when constructing the ConfigProvider.

Description
No description provided
Readme 138 MiB
Languages
Jupyter Notebook 84.5%
HTML 12.1%
Python 3.3%