Feature extraction - wrapper around schema.apply_udf #198

FelipeAdachi · 2023-11-21T21:13:02Z

Currently, if the user wants to use Langkit for a Feature Extraction scenario, they would neet to run:

import toxicity
from whylogs.experimental.core.udf_schema import udf_schema
import pandas as pd

df = pd.DataFrame({"prompt": ["I love you", "I hate you"]})
schema = udf_schema()

df_enhanced, _ = schema.apply_udfs(df)

Which unnecessarily exposes the user to whylogs' udf_schema and provides a confusing tuple output.

This PR wraps the code above into a langkit.extract function, so it becomes like this:

import langkit
from langkit import toxicity

df = pd.DataFrame({"prompt": ["I love you", "I hate you"]})
enhanced_df = langkit.extract(data=df)

or, for the row case:

import langkit
from langkit import toxicity

row = {"prompt": "I love you", "response": "I hate you"}
enhanced_row = langkit.extract(data=row)

also:

incidental error handling in hallucination module

langkit/extract.py

… into dev/felipe/extract

jamie256

This looks great @FelipeAdachi, thanks!

langkit/extract.py

felipe207 and others added 4 commits November 21, 2023 17:59

extract feature

afb913c

test cleanup

3d32159

Merge branch 'main' into dev/felipe/extract

cb34cb1

Merge branch 'main' into dev/felipe/extract

6f307a6

jamie256 reviewed Nov 22, 2023

View reviewed changes

langkit/extract.py Outdated Show resolved Hide resolved

FelipeAdachi commented Nov 22, 2023

View reviewed changes

langkit/extract.py Show resolved Hide resolved

felipe207 added 3 commits November 22, 2023 18:01

refactor

5fea352

Merge branch 'dev/felipe/extract' of https://github.com/whylabs/langkit…

d6f6166

… into dev/felipe/extract

raise invalid data type error

b7de70b

jamie256 approved these changes Nov 22, 2023

View reviewed changes

FelipeAdachi mentioned this pull request Nov 23, 2023

Proactive Injection Detection #201

Merged

jamie256 reviewed Nov 27, 2023

View reviewed changes

langkit/extract.py Outdated Show resolved Hide resolved

better error message

bdf69d2

jamie256 approved these changes Nov 27, 2023

View reviewed changes

jamie256 merged commit 23497fa into main Nov 27, 2023

jamie256 deleted the dev/felipe/extract branch November 27, 2023 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature extraction - wrapper around schema.apply_udf #198

Feature extraction - wrapper around schema.apply_udf #198

Uh oh!

FelipeAdachi commented Nov 21, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jamie256 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Feature extraction - wrapper around schema.apply_udf #198

Feature extraction - wrapper around schema.apply_udf #198

Uh oh!

Conversation

FelipeAdachi commented Nov 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jamie256 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

FelipeAdachi commented Nov 21, 2023 •

edited

Loading