text-generation-webui/extensions/openai/moderations.py

import time
import numpy as np
from numpy.linalg import norm
from extensions.openai.embeddings import get_embeddings


moderations_disabled = False  # return 0/false
category_embeddings = None
antonym_embeddings = None
categories = ["sexual", "hate", "harassment", "self-harm", "sexual/minors", "hate/threatening", "violence/graphic", "self-harm/intent", "self-harm/instructions", "harassment/threatening", "violence"]
flag_threshold = 0.5


def get_category_embeddings() -> dict:
    global category_embeddings, categories
    if category_embeddings is None:
        embeddings = get_embeddings(categories).tolist()
        category_embeddings = dict(zip(categories, embeddings))

    return category_embeddings


def cosine_similarity(a: np.ndarray, b: np.ndarray) -> float:
    return np.dot(a, b) / (norm(a) * norm(b))


# seems most openai like with all-mpnet-base-v2
def mod_score(a: np.ndarray, b: np.ndarray) -> float:
    return 2.0 * np.dot(a, b)


def moderations(input):
    global category_embeddings, categories, flag_threshold, moderations_disabled
    results = {
        "id": f"modr-{int(time.time()*1e9)}",
        "model": "text-moderation-001",
        "results": [],
    }

    if moderations_disabled:
        results['results'] = [{
            'categories': dict([(C, False) for C in categories]),
            'category_scores': dict([(C, 0.0) for C in categories]),
            'flagged': False,
        }]
        return results

    category_embeddings = get_category_embeddings()

    # input, string or array
    if isinstance(input, str):
        input = [input]

    for in_str in input:
        for ine in get_embeddings([in_str]):
            category_scores = dict([(C, mod_score(category_embeddings[C], ine)) for C in categories])
            category_flags = dict([(C, bool(category_scores[C] > flag_threshold)) for C in categories])
            flagged = any(category_flags.values())

            results['results'].extend([{
                'flagged': flagged,
                'categories': category_flags,
                'category_scores': category_scores,
            }])

    print(results)

    return results
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`import time`
			`import numpy as np`
			`from numpy.linalg import norm`
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 16:28:12 +02:00			`from extensions.openai.embeddings import get_embeddings`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00

lint 2023-07-12 20:33:25 +02:00			`moderations_disabled = False # return 0/false`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`category_embeddings = None`
			`antonym_embeddings = None`
lint 2023-07-12 20:33:25 +02:00			`categories = ["sexual", "hate", "harassment", "self-harm", "sexual/minors", "hate/threatening", "violence/graphic", "self-harm/intent", "self-harm/instructions", "harassment/threatening", "violence"]`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`flag_threshold = 0.5`


extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 16:28:12 +02:00			`def get_category_embeddings() -> dict:`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`global category_embeddings, categories`
			`if category_embeddings is None:`
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 16:28:12 +02:00			`embeddings = get_embeddings(categories).tolist()`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`category_embeddings = dict(zip(categories, embeddings))`

			`return category_embeddings`


extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 16:28:12 +02:00			`def cosine_similarity(a: np.ndarray, b: np.ndarray) -> float:`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`return np.dot(a, b) / (norm(a) * norm(b))`


			`# seems most openai like with all-mpnet-base-v2`
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 16:28:12 +02:00			`def mod_score(a: np.ndarray, b: np.ndarray) -> float:`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`return 2.0 * np.dot(a, b)`


			`def moderations(input):`
			`global category_embeddings, categories, flag_threshold, moderations_disabled`
			`results = {`
			`"id": f"modr-{int(time.time()*1e9)}",`
			`"model": "text-moderation-001",`
			`"results": [],`
			`}`

extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 16:28:12 +02:00			`if moderations_disabled:`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`results['results'] = [{`
lint 2023-07-12 20:33:25 +02:00			`'categories': dict([(C, False) for C in categories]),`
			`'category_scores': dict([(C, 0.0) for C in categories]),`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`'flagged': False,`
			`}]`
			`return results`

			`category_embeddings = get_category_embeddings()`

			`# input, string or array`
			`if isinstance(input, str):`
			`input = [input]`

			`for in_str in input:`
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 16:28:12 +02:00			`for ine in get_embeddings([in_str]):`
lint 2023-07-12 20:33:25 +02:00			`category_scores = dict([(C, mod_score(category_embeddings[C], ine)) for C in categories])`
			`category_flags = dict([(C, bool(category_scores[C] > flag_threshold)) for C in categories])`
extensions/openai: Major openai extension updates & fixes (#3049) * many openai updates * total reorg & cleanup. * fixups * missing import os for images * +moderations, custom_stopping_strings, more fixes * fix bugs in completion streaming * moderation fix (flagged) * updated moderation categories --------- Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org> 2023-07-11 23:50:08 +02:00			`flagged = any(category_flags.values())`

			`results['results'].extend([{`
			`'flagged': flagged,`
			`'categories': category_flags,`
			`'category_scores': category_scores,`
			`}])`

			`print(results)`

lint 2023-07-12 20:33:25 +02:00			`return results`