github-actions[bot] bae999f1d9
build(deps): bump rapidfuzz from 3.5.1 to 3.5.2 (#321)
Bumps [rapidfuzz](https://github.com/maxbachmann/RapidFuzz) from 3.5.1
to 3.5.2.
<details>
<summary>Release notes</summary>
<p><em>Sourced from <a
href="https://github.com/maxbachmann/RapidFuzz/releases">rapidfuzz's
releases</a>.</em></p>
<blockquote>
<h2>Release 3.5.2</h2>
<h2>Fixed</h2>
<ul>
<li>use _mm_malloc/_mm_free on macOS if aligned_alloc is
unsupported</li>
</ul>
</blockquote>
</details>
<details>
<summary>Changelog</summary>
<p><em>Sourced from <a
href="https://github.com/maxbachmann/RapidFuzz/blob/main/CHANGELOG.rst">rapidfuzz's
changelog</a>.</em></p>
<blockquote>
<h2>Changelog</h2>
<p>[3.5.2] - 2023-11-02
^^^^^^^^^^^^^^^^^^^^
Fixed</p>
<pre><code>* use _mm_malloc/_mm_free on macOS if aligned_alloc is
unsupported
<p>[3.5.1] - 2023-10-31
^^^^^^^^^^^^^^^^^^^^
Fixed
</code></pre></p>
<ul>
<li>fix compilation failure on macOS</li>
</ul>
<p>[3.5.0] - 2023-10-31
^^^^^^^^^^^^^^^^^^^^
Changed</p>
<pre><code>* skip pandas ``pd.NA`` similar to ``None``
* add ``score_multiplier`` argument to ``process.cdist`` which allows
multiplying the end result scores
  with a constant factor.
* drop support for Python 3.7
<p>Performance
</code></pre></p>
<ul>
<li>improve performance of simd implementation for <code>LCS</code> /
<code>Indel</code> / <code>Jaro</code> / <code>JaroWinkler</code></li>
<li>improve performance of Jaro and Jaro Winkler for long sequences</li>
<li>implement <code>process.extract</code> with <code>limit=1</code>
using <code>process.extractOne</code> which can be faster</li>
</ul>
<p>Fixed</p>
<pre><code>* the preprocessing function was always called through Python
due to a broken C-API version check
* fix wraparound issue in simd implementation of Jaro and Jaro Winkler
<p>[3.4.0] - 2023-10-09
^^^^^^^^^^^^^^^^^^^^
Changed
</code></pre></p>
<ul>
<li>upgrade to <code>Cython==3.0.3</code></li>
<li>add simd implementation for Jaro and Jaro Winkler</li>
</ul>
<p>[3.3.1] - 2023-09-25
^^^^^^^^^^^^^^^^^^^^
Added</p>
<pre><code>* add missing tag for python 3.12 support
<p>[3.3.0] - 2023-09-11
^^^^^^^^^^^^^^^^^^^^
&lt;/tr&gt;&lt;/table&gt;
</code></pre></p>
</blockquote>
<p>... (truncated)</p>
</details>
<details>
<summary>Commits</summary>
<ul>
<li><a
href="0a9a8f81c9"><code>0a9a8f8</code></a>
use _mm_malloc/_mm_free on macOS if aligned_alloc is unsupported</li>
<li><a
href="2114f513d3"><code>2114f51</code></a>
fix typing</li>
<li>See full diff in <a
href="https://github.com/maxbachmann/RapidFuzz/compare/v3.5.1...v3.5.2">compare
view</a></li>
</ul>
</details>
<br />


[![Dependabot compatibility
score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=rapidfuzz&package-manager=pip&previous-version=3.5.1&new-version=3.5.2)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores)

Dependabot will resolve any conflicts with this PR as long as you don't
alter it yourself. You can also trigger a rebase manually by commenting
`@dependabot rebase`.

[//]: # (dependabot-automerge-start)
[//]: # (dependabot-automerge-end)

---

<details>
<summary>Dependabot commands and options</summary>
<br />

You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits
that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after
your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge
and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating
it. You can achieve the same result by closing it manually
- `@dependabot show <dependency name> ignore conditions` will show all
of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop
Dependabot creating any more for this major version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop
Dependabot creating any more for this minor version (unless you reopen
the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop
Dependabot creating any more for this dependency (unless you reopen the
PR or upgrade to it yourself)


</details>
2023-11-03 20:20:27 +01:00
2023-10-21 10:58:54 +02:00
2023-10-20 16:36:54 +02:00

aki_prj23_transparenzregister

python Actions status Ruff pre-commit Checked with mypy Documentation Status Code style: black

Contributions

See the CONTRIBUTING.md about how code should be formatted and what kind of rules we set ourselves.

Available entrypoints

The project has currently the following entrypoint available:

  • data-transformation > Transfers all the data from the mongodb into the sql db to make it available as production data.
  • reset-sql > Resets all sql tables in the connected db.
  • copy-sql > Copys the content of a db to another db.
  • webserver > Starts the webserver showing the analysis results.

DB Connection settings

To connect to the SQL db see sql/connector.py To connect to the Mongo db see [connect]

Create a secrets.json in the root of this repo with the following structure (values to be replaces by desired config):

The sqlite db is alternative to the postgres section.

{
  "sqlite": "path-to-sqlite.db",
  "postgres": {               
    "username": "username",      
    "password": "password",
    "host": "localhost",
    "database": "db-name",
    "port": 5432
  },
  "mongo": {
    "username": "username",
    "password": "password",
    "host": "localhost",
    "database": "transparenzregister",
    "port": 27017
  }
}

Alternatively, the secrets can be provided as environment variables. One option to do so is to add a .env file with the following layout:

PYTHON_POSTGRES_USERNAME=postgres
PYTHON_POSTGRES_PASSWORD=postgres
PYTHON_POSTGRES_HOST=localhost
PYTHON_POSTGRES_DATABASE=postgres
PYTHON_POSTGRES_PORT=5432

PYTHON_MONGO_USERNAME=username
PYTHON_MONGO_HOST=localhost
PYTHON_MONGO_PASSWORD=password
PYTHON_MONGO_PORT=27017
PYTHON_MONGO_DATABASE=transparenzregister

PYTHON_SQLITE_PATH=PathToSQLite3.db # An overwrite path to an sqllite db

PYTHON_DASH_LOGIN_USERNAME=some-login-to-webgui
PYTHON_DASH_LOGIN_PW=some-pw-to-login-to-webgui

CR=ghcr.io/fhswf/aki_prj23_transparenzregister
TAG=latest

HTTP_PORT=80

The prefix PYTHON_ can be customized by setting a different prefix when constructing the ConfigProvider.

Description
No description provided
Readme 138 MiB
Languages
Jupyter Notebook 84.5%
HTML 12.1%
Python 3.3%