Alpha Software Mobile Development Tools:   Alpha Anywhere    |   Alpha TransForm subscribe to our YouTube Channel  Follow Us on LinkedIn  Follow Us on Twitter  Follow Us on Facebook

Announcement

Collapse

The Alpha Software Forum Participation Guidelines

The Alpha Software Forum is a free forum created for Alpha Software Developer Community to ask for help, exchange ideas, and share solutions. Alpha Software strives to create an environment where all members of the community can feel safe to participate. In order to ensure the Alpha Software Forum is a place where all feel welcome, forum participants are expected to behave as follows:
  • Be professional in your conduct
  • Be kind to others
  • Be constructive when giving feedback
  • Be open to new ideas and suggestions
  • Stay on topic


Be sure all comments and threads you post are respectful. Posts that contain any of the following content will be considered a violation of your agreement as a member of the Alpha Software Forum Community and will be moderated:
  • Spam.
  • Vulgar language.
  • Quotes from private conversations without permission, including pricing and other sales related discussions.
  • Personal attacks, insults, or subtle put-downs.
  • Harassment, bullying, threatening, mocking, shaming, or deriding anyone.
  • Sexist, racist, homophobic, transphobic, ableist, or otherwise discriminatory jokes and language.
  • Sexually explicit or violent material, links, or language.
  • Pirated, hacked, or copyright-infringing material.
  • Encouraging of others to engage in the above behaviors.


If a thread or post is found to contain any of the content outlined above, a moderator may choose to take one of the following actions:
  • Remove the Post or Thread - the content is removed from the forum.
  • Place the User in Moderation - all posts and new threads must be approved by a moderator before they are posted.
  • Temporarily Ban the User - user is banned from forum for a period of time.
  • Permanently Ban the User - user is permanently banned from the forum.


Moderators may also rename posts and threads if they are too generic or do not property reflect the content.

Moderators may move threads if they have been posted in the incorrect forum.

Threads/Posts questioning specific moderator decisions or actions (such as "why was a user banned?") are not allowed and will be removed.

The owners of Alpha Software Corporation (Forum Owner) reserve the right to remove, edit, move, or close any thread for any reason; or ban any forum member without notice, reason, or explanation.

Community members are encouraged to click the "Report Post" icon in the lower left of a given post if they feel the post is in violation of the rules. This will alert the Moderators to take a look.

Alpha Software Corporation may amend the guidelines from time to time and may also vary the procedures it sets out where appropriate in a particular case. Your agreement to comply with the guidelines will be deemed agreement to any changes to it.



Bonus TIPS for Successful Posting

Try a Search First
It is highly recommended that a Search be done on your topic before posting, as many questions have been answered in prior posts. As with any search engine, the shorter the search term, the more "hits" will be returned, but the more specific the search term is, the greater the relevance of those "hits". Searching for "table" might well return every message on the board while "tablesum" would greatly restrict the number of messages returned.

When you do post
First, make sure you are posting your question in the correct forum. For example, if you post an issue regarding Desktop applications on the Mobile & Browser Applications board , not only will your question not be seen by the appropriate audience, it may also be removed or relocated.

The more detail you provide about your problem or question, the more likely someone is to understand your request and be able to help. A sample database with a minimum of records (and its support files, zipped together) will make it much easier to diagnose issues with your application. Screen shots of error messages are especially helpful.

When explaining how to reproduce your problem, please be as detailed as possible. Describe every step, click-by-click and keypress-by-keypress. Otherwise when others try to duplicate your problem, they may do something slightly different and end up with different results.

A note about attachments
You may only attach one file to each message. Attachment file size is limited to 2MB. If you need to include several files, you may do so by zipping them into a single archive.

If you forgot to attach your files to your post, please do NOT create a new thread. Instead, reply to your original message and attach the file there.

When attaching screen shots, it is best to attach an image file (.BMP, .JPG, .GIF, .PNG, etc.) or a zip file of several images, as opposed to a Word document containing the screen shots. Because Word documents are prone to viruses, many message board users will not open your Word file, therefore limiting their ability to help you.

Similarly, if you are uploading a zipped archive, you should simply create a .ZIP file and not a self-extracting .EXE as many users will not run your EXE file.
See more
See less

Summarising text patterns in a compound string

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

    Summarising text patterns in a compound string

    My next challenge is to read through half a million records and analyse the text in one field.
    The field has numerics - which I am not interested in - and text which I need to look into.
    Text can be similar to;
    Genogram,
    CDF Referral
    Third Party Document regarding housing allowance.

    They are actually file names.


    Is there any way to summarise contiguous text blocks containing different text items?
    Contiguous text is what I am after and there will be a shed load, I realised that.
    I can chop and slice once I get the main elements.
    See our Hybrid Option here;
    https://hybridapps.example-software.com/


    Apologies to anyone I haven't managed to upset yet.
    You are held in a queue and I will get to you soon.

    #2
    Re: Summarising text patterns in a compound string

    Actual sample field contents?

    fieldname = "2345347Genogram 9473573 "
    ? rtrim(ltrim(fieldname,"0123456789"),"0123456789")
    = "Genogram"
    There can be only one.

    Comment


      #3
      Re: Summarising text patterns in a compound string

      Problem is Stan, I don't have a handle on the text block.
      How would I put this in a Summarise op?
      See our Hybrid Option here;
      https://hybridapps.example-software.com/


      Apologies to anyone I haven't managed to upset yet.
      You are held in a queue and I will get to you soon.

      Comment


        #4
        Re: Summarising text patterns in a compound string

        Actual sample field contents? I don't have a handle on what you mean.

        Is

        Genogram,
        CDF Referral
        Third Party Document regarding housing allowance.

        in one field? Is this a memo field?
        There can be only one.

        Comment


          #5
          Re: Summarising text patterns in a compound string

          No Stan, each is in a single field.
          There are 500k records with a structure of
          Id/name/text.
          The examples are in separate records.
          It's not a Memo field

          What I am attempting to do is to identify the different words used in the text field.
          The users were given instructions to use certain phrases but they are pretty mixed up.

          A summary of different words would get me started, like

          Genogram 155
          CDF 200
          Referral 566
          Third 122
          Party 124
          Last edited by Ted Giles; 06-11-2015, 03:42 AM.
          See our Hybrid Option here;
          https://hybridapps.example-software.com/


          Apologies to anyone I haven't managed to upset yet.
          You are held in a queue and I will get to you soon.

          Comment


            #6
            Re: Summarising text patterns in a compound string

            Originally posted by Ted Giles View Post
            Problem is Stan, I don't have a handle on the text block.
            How would I put this in a Summarise op?
            if you write a function as follows that will give any alpha string in string containing varied numbers in front or in the back of the string to harvest.

            Code:
            FUNCTION stripN AS C (inputStr AS C )
            	
            	while isalpha(inputStr)=.f.
            		inputStr=right(inputStr,len(inputStr)-1)
            	end while
            	while isalpha(right(inputStr,1))=.f.
            		inputStr=left(inputStr,len(inputStr)-1)
            		end while
            	stripN = inputStr
            END FUNCTION
            ?txt
            = "1093103ksdsdj13031"

            ?stripN(txt)
            = "ksdsdj"
            once it is done would you be able to run the summarize operation?
            thanks for reading

            gandhi

            version 11 3381 - 4096
            mysql backend
            http://www.alphawebprogramming.blogspot.com
            [email protected]
            Skype:[email protected]
            1 914 924 5171

            Comment


              #7
              Re: Summarising text patterns in a compound string

              I will try this G.
              If I can use the logic in a Summarise Condition that would be great.
              Not sure I can though.

              The other problem is differentiating between 1234Text Different text3456
              I'm trying to isolate contiguous text so the above would come out as

              Text 2
              Different 1
              See our Hybrid Option here;
              https://hybridapps.example-software.com/


              Apologies to anyone I haven't managed to upset yet.
              You are held in a queue and I will get to you soon.

              Comment


                #8
                Re: Summarising text patterns in a compound string

                are these
                1234Text
                Different
                text3456
                like in each record
                or one record will have
                1234Text Different text3456
                ?
                thanks for reading

                gandhi

                version 11 3381 - 4096
                mysql backend
                http://www.alphawebprogramming.blogspot.com
                [email protected]
                Skype:[email protected]
                1 914 924 5171

                Comment


                  #9
                  Re: Summarising text patterns in a compound string

                  A summary operation won't do you any good until you have the data in proper form.

                  You need to create a destination table and then a script that writes the text values to the destination

                  open the original table
                  fetch the first record
                  open the destination table
                  begin a while loop
                  extract the text in the first record, rtrim(ltrim(fieldname,"0123456789"),"0123456789") and stripn() do the same thing
                  get the unique words in the text with WORD_UNIQUE()
                  count the unique words with w_count()
                  begin a for....next loop using that count
                  enter records in the destination table for each of the unique words
                  end the for....next loop
                  fetch the next record in the original table
                  end the while loop
                  close both tables

                  Now you can summarize the destination table to get what you want.
                  There can be only one.

                  Comment


                    #10
                    Re: Summarising text patterns in a compound string

                    Thanks for the good advice and options both of you.
                    What I ended up doing was a mixture.
                    I ran an Update procedure to get rid of 0-9, &,%,-, and any spurious characters.
                    Then I created a table with 5 text fields in it and used Word() to pick the first 5 words out and load them into the table.
                    I'd forgotten about Unique, so that is the next part of the operation.
                    As I said, many thanks and just talking about it helped a lot.
                    Ted
                    See our Hybrid Option here;
                    https://hybridapps.example-software.com/


                    Apologies to anyone I haven't managed to upset yet.
                    You are held in a queue and I will get to you soon.

                    Comment


                      #11
                      Re: Summarising text patterns in a compound string

                      If you want the script.

                      Code:
                      o_tbl = table.open("original",FILE_RW_EXCLUSIVE)
                      tbl.fetch_first()
                      d_tbl = table.open("destination",FILE_RW_EXCLUSIVE)
                      while .not. o_tbl.fetch_eof()
                      	wds = rtrim(ltrim(textfieldname,"0123456789"),"0123456789")
                      	uwds = word_unique(wds," ")
                      	cnt = w_count(uwds)
                      	for qx = 1 to cnt
                      		d_tbl.enter_begin()
                      		d_tbl.wordfieldname = word(uwds,qx," ")
                      		d_tbl.enter_end()
                      	next qx
                      	o_tbl.fetch_next()
                      end while
                      o_tbl.close()
                      d_tbl.close()
                      There can be only one.

                      Comment


                        #12
                        Re: Summarising text patterns in a compound string

                        That is very kind of you Stan.
                        My analysis so far has uncovered a load of worms.
                        People have been transposing text.
                        One will put a descripter in as "Assessment Core Meeting"
                        Another as "Core Assessment Meeting
                        They are the same thing.

                        So I will still need to eyeball the results and come up with a taxonomy which works for the client.
                        See our Hybrid Option here;
                        https://hybridapps.example-software.com/


                        Apologies to anyone I haven't managed to upset yet.
                        You are held in a queue and I will get to you soon.

                        Comment


                          #13
                          Re: Summarising text patterns in a compound string

                          You will be able to streamline the process somewhat if you filter for

                          "core" $ fieldname .and. "assessment" $ fieldname

                          for example. This will allow you to see common terms, possibly use copy-paste or an update operation once you are sure the records are truly grouped as you want..

                          Thinking you could update the table with "Core Assessment Meeting" filtering for

                          "core" $ fieldname .and. "assessment" $ fieldname .and. "meeting" $ fieldname

                          ?
                          There can be only one.

                          Comment


                            #14
                            Re: Summarising text patterns in a compound string

                            Taxonomy? oooohh good one Ted!
                            Robin

                            Discernment is not needed in things that differ, but in those things that appear to be the same. - Miles Sanford

                            Comment


                              #15
                              Re: Summarising text patterns in a compound string

                              Originally posted by Stan Mathews View Post
                              You will be able to streamline the process somewhat if you filter for

                              "core" $ fieldname .and. "assessment" $ fieldname

                              for example. This will allow you to see common terms, possibly use copy-paste or an update operation once you are sure the records are truly grouped as you want..

                              Thinking you could update the table with "Core Assessment Meeting" filtering for

                              "core" $ fieldname .and. "assessment" $ fieldname .and. "meeting" $ fieldname

                              ?
                              Thanks Stan. The object of the exercise is to identify the Key Words so I will look into this.
                              See our Hybrid Option here;
                              https://hybridapps.example-software.com/


                              Apologies to anyone I haven't managed to upset yet.
                              You are held in a queue and I will get to you soon.

                              Comment

                              Working...
                              X