2011-07-25 9 views
5

Da un datastore vuoto, sono stato in grado di generare automaticamente un file bulkloader.yaml. Contiene solo python_preamble, ma la sezione transformers era vuota.Dove vengono trasformate le pagine di riferimento del bulkloader di Google App Engine?

python_preamble: 
- import: google.appengine.ext.bulkload.transform 
- import: google.appengine.ext.bulkload.bulkloader_wizard 
- import: my_own_transformers 
- import: data_models # This is where the SomeData class is defined. 
# some more imports here 

Poi in base agli esempi nella documentazione, ho bisogno di definire una mappa delle proprietà per ciascuna delle colonne del mio CSV:

transformers: 
- kind: SomeData 
    connector: csv 
    property_map: 
    - property: date 
     import_transform: transform.some_undocumented_function 

due domande:

mia comprensione è che la funzione definita come import_transform trasformerà la normale stringa CSV in un valore Property Class degno del datastore. Voglio capire come funzionano le trasformazioni, quindi penso di avere due alternative.

  1. Dove si trova il riferimento di libreria per google.appengine.ext.bulkload.transform? Voglio sapere come utilizzare transform.some_undocumented_function, così come tutti gli altri transform.some_other_undocumented_transformers

  2. Si può vedere dalla mia python_preamble che ho - import: my_own_transformers. In quel modulo, ho definito una funzione transform_date che accetta una stringa di data ISO come 2001-01-01 e la trasforma in un tipo che può rientrare in db.DateProperty(). Se il mio concetto è corretto, posso usare:

 
property_map: 
    - property: date 
    import_transform: my_own_transforms.transform_date 
+0

Non trasformare i dati in classi di proprietà o anche istanze di classi di proprietà. Le classi di proprietà sono un dettaglio di come 'db.Model 'denota i modelli; i tipi di dati che possono memorizzare sono completamente separati. –

+0

@Nick, penso che tu intenda "db.Model" è un esempio di [Classe del modello] (http://code.google.com/appengine/docs/python/datastore/datamodeling.html#Model_Classes), una sottoclasse di cui ho chiamato 'SomeData' nella mia domanda, e un' db.DateProperty() 'è un esempio di una [Property Class] (http://code.google.com/appengine/docs/python/datastore/typesandpropertyclasses. html). Questo è quello che intendevo anche io. Non sono sicuro di dove sono incappato nella mia spiegazione. – Kit

+0

@Nick, penso che avrei dovuto dire "trasformare la stringa CSV in un tipo di dati richiesto da un'istanza della classe di proprietà di' SomeData' ". È corretto? – Kit

risposta

7

1)
È possibile controllare la source code, o che il qualcosa di console interattiva di simile:

from google.appengine.ext.bulkload import transform 
help(transform) 

si vuole get:

Help on module google.appengine.ext.bulkload.transform in google.appengine.ext.bulkload: 

NAME 
    google.appengine.ext.bulkload.transform - Bulkloader Transform Helper functions. 

FILE 
    /Applications/GoogleAppEngineLauncher.app/Contents/Resources/GoogleAppEngine-default.bundle/Contents/Resources/google_appengine/google/appengine/ext/bulkload/transform.py 

DESCRIPTION 
    A collection of helper functions for bulkloading data, typically referenced 
    from a bulkloader.yaml file. 

FUNCTIONS 
    blob_to_file(filename_hint_propertyname=None, directory_hint='') 
     Write the blob contents to a file, and replace them with the filename. 

     Args: 
      filename_hint_propertyname: If present, the filename will begin with 
      the contents of this value in the entity being exported. 
      directory_hint: If present, the files will be stored in this directory. 

     Returns: 
      A function which writes the input blob to a file. 

    blobproperty_from_base64 = wrapper(value) 

    bytestring_from_base64 = wrapper(value) 

    child_node_from_list(child_node_name) 
     Return a value suitable for generating an XML child node on export. 

     The return value is a list of tuples which the simplexml connector will 
     use to build a child node. 

     See also list_from_child_node 

     Args: 
      child_node_name: The name to use for each child node. 

     Returns: 
      Transform function which works as described in the args. 

    create_deep_key(*path_info) 
     A method to make multi-level Key objects. 

     Generates multi-level key from multiple fields in the input dictionary. 

     This is typically used for Keys for entities which have variable parent keys, 
     e.g. ones with owned relationships. It can used for both __key__ and 
     references. 

     Use create_foreign_key as a simpler way to create single level keys. 

     Args: 
      path_info: List of tuples, describing (kind, property, is_id=False). 
      kind: The kind name. 
      property: The external property in the current import dictionary, or 
       transform.CURRENT_PROPERTY for the value passed to the transform. 
      is_id: Converts value to int and treats as numeric ID if True, otherwise 
       the value is a string name. Default is False. 
      Example: 
       create_deep_key(('rootkind', 'rootcolumn'), 
           ('childkind', 'childcolumn', True), 
           ('leafkind', transform.CURRENT_PROPERTY)) 

     Returns: 
      Transform method which parses the info from the current neutral dictionary 
      into a Key with parents as described by path_info. 

    create_foreign_key(kind, key_is_id=False) 
     A method to make one-level Key objects. 

     These are typically used in ReferenceProperty in Python, where the reference 
     value is a key with kind (or model) name name. 

     This helper method does not support keys with parents. Use create_deep_key 
     instead to create keys with parents. 

     Args: 
      kind: The kind name of the reference as a string. 
      key_is_id: If true, convert the key into an integer to be used as an id. 
      If false, leave the key in the input format (typically a string). 

     Returns: 
      Single argument method which parses a value into a Key of kind entity_kind. 

    empty_if_none(fn) 
     A wrapper for a value to return '' if it's None. Useful on export. 

     Can be used in config files (e.g. "transform.empty_if_none(unicode)" or 
     as a decorator. 

     Args: 
      fn: Single argument transform function. 

     Returns: 
      Wrapped function. 

    export_date_time(format) 
     A wrapper around strftime. Also returns '' if the input is None. 

     Args: 
      format: Format string for strftime. 

     Returns: 
      Single argument method which convers a datetime into a string using format. 

    import_date_time(format, _strptime=None) 
     A wrapper around strptime. Also returns None if the input is empty. 

     Args: 
      format: Format string for strptime. 

     Returns: 
      Single argument method which parses a string into a datetime using format. 

    join_list(delimeter) 
     Join a list into a string using the delimeter. 

     This is just a wrapper for string.join. 

     Args: 
      delimeter: The delimiter to use when joining the string. 

     Returns: 
      Method which joins the list into a string with the delimeter. 

    key_id_or_name_as_string = transform_function(key) 

    key_id_or_name_as_string_n(index) 
     Pull out the nth (0-based) key id or name from a key which has parents. 

     If a key is present, return its id or name as a string. 

     Note that this loses the distinction between integer IDs and strings 
     which happen to look like integers. Use key_type to distinguish them. 

     This is a useful complement to create_deep_key. 

     Args: 
      index: The depth of the id or name to extract. Zero is the root key. 
       Negative one is the leaf key. 

     Returns: 
      Function extracting the name or ID of the key at depth index, as a unicode 
      string. Returns '' if key is empty (unsaved), otherwise raises IndexError 
      if the key is not as deep as described. 

    key_kind = wrapper(value) 

    key_kind_n(index) 
     Pull out the nth (0-based) key kind from a key which has parents. 

     This is a useful complement to create_deep_key. 

     Args: 
      index: The depth of the id or name to extract. Zero is the root key. 
      Negative one is the leaf key. 

     Returns: 
      Function returning the kind of the key at depth index, or raising 
      IndexError if the key is not as deep as described. 

    key_type = transform_function(key) 

    key_type_n(index) 
     Pull out the nth (0-based) key type from a key which has parents. 

     This is most useful when paired with key_id_or_name_as_string_n. 
     This is a useful complement to create_deep_key. 

     Args: 
      index: The depth of the id or name to extract. Zero is the root key. 
       Negative one is the leaf key. 

     Returns: 
      Method returning the type ('ID' or 'name') of the key at depth index. 
      Returns '' if key is empty (unsaved), otherwise raises IndexError 
      if the key is not as deep as described. 

    list_from_child_node(xpath, suppress_blank=False) 
     Return a list property from child nodes of the current xml node. 

     This applies only the simplexml helper, as it assumes __node__, the current 
     ElementTree node corresponding to the import record. 

     Sample usage for structure: 
     <Visit> 
      <VisitActivities> 
      <Activity>A1</Activity> 
      <Activity>A2</Activity> 
      </VisitActivities> 
     </Visit> 

     property: activities 
     external_name: VisitActivities # Ignored on import, used on export. 
     import_transform: list_from_xml_node('VisitActivities/Activity') 
     export_transform: child_node_from_list('Activity') 

     Args: 
      xpath: XPath to run on the current node. 
      suppress_blank: if True, ndoes with no text will be skipped. 

     Returns: 
      Transform function which works as described in the args. 

    list_from_multiproperty(*external_names) 
     Create a list from multiple properties. 

     Args: 
      external_names: List of the properties to use. 

     Returns: 
      Transform function which returns a list of the properties in external_names. 

    none_if_empty(fn) 
     A decorator which returns None if its input is empty else fn(x). 

     Useful on import. Can be used in config files 
     (e.g. "transform.none_if_empty(int)" or as a decorator. 

     Args: 
      fn: Single argument transform function. 

     Returns: 
      Wrapped function. 

    property_from_list(index) 
     Return the Nth item from a list, or '' if the list is shorter. 

     Args: 
      index: Item in the list to return. 

     Returns: 
      Function returning the item from a list, or '' if the list is too short. 

    regexp_bool(regexp, flags=0) 
     Return a boolean if the expression matches with re.match. 

     Note that re.match anchors at the start but not end of the string. 

     Args: 
      regexp: String, regular expression. 
      flags: Optional flags to pass to re.match. 

     Returns: 
      Method which returns a Boolean if the expression matches. 

    regexp_extract(pattern, method=<function match at 0x336270>, group=1) 
     Return first group in the value matching the pattern using re.match. 

     Args: 
      pattern: A regular expression to match on with at least one group. 
      method: The method to use for matching; normally re.match or re.search. 
      group: The group to use for extracting a value. 

     Returns: 
      A single argument method which returns the group_arg group matched, 
      or None if no match was found or the input was empty. 

    regexp_to_list(pattern) 
     Return function that returns a list of objects that match the regex. 

     Useful on import. Uses the provided regex to split a string value into a list 
     of strings. Wrapped by none_if_input_or_result_empty, so returns none if 
     there are no matches for the regex and none if the input is empty. 

     Args: 
      pattern: A regular expression pattern to match against the input string. 

     Returns: 
      None if the input was none or no matches were found, otherwise a list of 
      strings matching the input expression. 

    split_string(delimeter) 
     Split a string using the delimeter into a list. 

     This is just a wrapper for string.split. 

     Args: 
      delimeter: The delimiter to split the string on. 

     Returns: 
      Method which splits the string into a list along the delimeter. 

DATA 
    CURRENT_PROPERTY = None 
    KEY_TYPE_ID = 'ID' 
    KEY_TYPE_NAME = 'name' 
    __loader__ = <google.appengine.tools.dev_appserver.HardenedModulesHook... 

2)
Esattamente, è possibile utilizzare le funzioni di trasformazione definite o, in questo caso specifico, è possibile utilizzare direttamente transform.import_date_time.

import_date_time(format, _strptime=None) 
    A wrapper around strptime. Also returns None if the input is empty. 

    Args: 
    format: Format string for strptime. 

    Returns: 
    Single argument method which parses a string into a datetime using format.