1 Attachment(s)
How can i extract text from a word document TextBox
i have been working on word Document files.
each document has same text format.
i m using the code below to extract text from it,
Code:
Dim sFileName As String 'Name of files within the folder will be stored
Dim i As Integer 'This will count the files read***
Dim wrd As Word.Application
Dim wrdCount As Long 'Number of characters within the file
'Get the first file with extension "DOC" ***
sFileName = Dir$("C:\Virology\*.DOC") '***
Do While sFileName <> "" '***
Set wrd = New Word.Application 'Create obj of word application
wrd.Visible = False
wrd.Documents.Open "C:\Virology\" & sFileName
wrdCount = wrd.Documents(1).Characters.Count
Text1.Text = Text1.Text & vbCrLf & i & wrd.Documents(1).Range(0, wrdCount).Text
wrd.Documents(1).Close
Set wrd = Nothing
sFileName = Dir$ '***
i = i + 1
Loop
This works good for all the characters in the document but the problem rises when word textbox appears in the document.
Re: How can i extract text from a word document TextBox
I downloaded your document and created the following two macros in it. When you run abd() it prints the contents of the textboxes in the Immediate window. I guess you'll want to do something about the formatting, but I think this is the start of what you want to do.
VB Code:
Sub abd()
Dim s As Shape
For Each s In ActiveDocument.Shapes
PrintShapeText s
Next
End Sub
Sub PrintShapeText(s As Shape)
Dim gs As Shape
If s.TextFrame.HasText Then
Debug.Print s.TextFrame.TextRange
Else
For Each gs In s.GroupItems
PrintShapeText gs
Next
End If
End Sub